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ABSTRACT 


This  report  is  the  second  in  a  series  presenting  a  study  of  self- 
correlation  algorithms  in  intelligence  systems.^  It  was  performed  by  the 
Institute  of  Decision  Science  at  ClaremontMcX^na  College  in  support  of 
the  Algorithm  Analysis  subtfsk  of“T:he”TrrS7^^ Army  Intelligence  Center  and 
School  (USAICS),  Software  Ah^ysls  and  Management  System  (USAMS)  task  at 
Jet  Propulsion  Laboratory. Thesa'  self-correiation' 9  algorithms  use 
multivariate  statistical  testa  to  determine  the  equality  of  mean  vectors 
from  two  different  data  sets.  For  example,  tests  are  used  to  determine  the 
equality  of  location  vectors  from  two  different  data  seta  (Are  the  data 
from  the  same  emitter?).  This  report  considers  estimation  of  the 
probability  that  these  tests  may  lead  to  an  Incorrect  decision.  Possible 
test  errors  are  studied  under  different  assumptions  concerning ! 


1)  the  distribution  of  the  data,  e.g.  normal  error,  skewed  error, 
etc.  ' 

.  J 

2)  the  estimated  location  of  the  emitter,  e.g.  mean  of  the  data, 
most  frequent  value,  etc.^ 

3)  the  variability  of  the  error,  i.e.  the  variance-covariance  of  the 
data  I 

4)  the  amount  of  data,  i.e.  sample  size'' 


Frequency  of  test  errors  'ere  estimated  by  simulation  for  most  of  the 
cases  studied.  The  results  indicate  that  in  some  of  the  cases  the  error 
rate  is  high  enough  to  be  of  j  sslble  concern. 
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I.  INTRODUCTION 


Correlation  techniques  for  battlefield  identification  use  various 
tests  for  determining  whether  or  not  a  new  set  of  data  is  from  a  previously 
identified  emitter  location.  These  decision  tests  are  based  on  certain 
assumptions  about  the  data.  Even  if  all  the  assumptions  are  met  the  tests 
have  possible  errors  associated  with  them.  When  some  of  the  assumptions 
concerning  the  data  are  not  met  the  frequency  of  test  errors  may  become 
even  greater,  perhaps  unacceptable. 

This  report  considers 

a.  Types  of  decision  errors  possible  when  performing  statistical 
tests. 

b.  Sensitivity  of  the  frequency  of  test  errors  to  incoming  data 
which  do  not  meet  the  assumptions  of  the  statistical  tests. 


TYPES  OF  DECISION  ERRORS  POSSIBLE  WHEN  PERFORMING  STATISTICAL  TESTS 
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One  of  the  Inherent  problems  with  statistical  tests  is  that  one  cannot 
claim  to  have  made  the  correct  decision  with  con?)lete  certainty.  Two  types 
of  decision  errors  can  be  made: 

I.  Deciding  that  the  new  data  are  not  from  the  previously  identified 
location  when  in  fact  they  are. 

II.  Deciding  that  the  new  data  are  from  the  previously  identified 
location  when  in  fact  they  are  not. 


TRUE  STATE  OF  NATURE 


Same  Location  Different  Location 


Same 

No  Error 

Error  of  the  Second 

TEST 

Location 

Type 

DECISION 

Different 

Error  of  the 

No  Error 

Location 

First  Type 

The  statistical  tests  considered  in  this  report  are  constructed  so 
that  the  probability  of  committing  an  error  of  Type  I  is  set  at  some  level, 
say  0.05.*  If  we  commit  an  error  of  the  first  type  we  would  be  putting  a 
new  emitter  location  erroneously  into  the  data-base.  If  we  commit  an  error 
of  the  second  type  (Type  II)  we  would  be  erroneously  shrinking  the 
confidence  ellipse  vdiich  is  centered  at  the  estimated  location  and  also 
would  not  be  putting  the  new  emitter  into  the  data  base.** 

The  general  diagram  on  page  3  illustrates  errors  in  relation  to 
emitter  locations  and  confidence  ellipses.  The  graph  in  Figure  1  shows  the 
relationship  between  a  variance-covariance  matrix  (sensor  error)  and  the 
resulting  95^  confidence  ellipse.  Figure  2  contains  specific  examples 
which  describe  statistical  decisions  to  relation  to  assumed  emitter 
locations  and  confidence  ellipses.  Additional  graphs  may  be  found  in  the 
body  of  the  report. 
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Often  erroneously  called  a  test  at  the  95!l  level. 

It  could  also  be  possible  that  the  new  data  belong  to 
previously  identified  emitter.  This  error  will  not  be 
here. 


a  different 
illustrated 


FIGURE  2 
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This  report  addresses  some  of  the  statistical  tests  used  in 
correlation  of  battlefield  data.  Based  on  certain  assumptions  (see  Page 
14}  these  tests  allow  one  to  either  accept  or  reject  that  the  new  data 
correspond  to  a  previously  identified  location. 


SENSITIVITY  OF  FREQUENCIES  OF  TEST  ERRORS  TO  NATURE  OF  INCOMING  DATA 

The  statistical  tests  used  in  the  self-correlation  algorithms  were 
designed  for  data  in  which  sensor  error  follows  the  Normal,  or  Gaussian, 
probability  law  with  known  spread  (variance-covariance).  In  addition  these 
tests  assume  that  the  mean  (expected)  value  of  the  data  distribution  is  the 
true  location  of  the  emitter. 


This  report  considers  error  frequency  when: 

1 .  The  underlying  error  distributions  are  not  Normal  but  follow  some 
other  type  of  observed  distribution. 

2.  The  variance-covariance  (spread)  is  not  known  but  is  estimated  or 
is  incorrect. 

3.  The  mean  of  the  data  distribution  is  incorrectly  estimated  (for 
example,  using  the  sample  mode  in  place  of  the  sample  mean  vdien 
the  data  distribution  is  skewed). 


The  relevant  questions  concerning  the  decision  errors  are: 

1 .  Can  such  errors  occur? 

2.  Under  what  conditions  can  decision  errors  occur? 

3.  If  such  errors  can  occur,  then  at  what  frequency  (with  vrtiat 
probability)? 

In  response  to  the  first  question,  errors  of  both  type  I  and  II  can 
occur . 

The  answers  to  the  second  and  third  questions  are  the  subject  of  this 
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II.  CONCLUSION 


Even  when  the  incoming  data  are  normally  distributed  with  known 
variance-covariance,  the  error  rates  in  the  test  decisions  can  cause 
problems  when  the  sample  sizes  of  the  incoming  data  sets  are  small  and  the 
measurement  errors  are  not  small.  When  the  variance-covariance  of  the 
incoming  data  are  estimated  and  not  known,  the  error  rates  in  the  test 
decisions  become  even  larger. 

The  frequency  of  errors  is  less  sensitive  to  changes  in  the  underlying 
distribution  than  it  is  to  changes  in  variance-covariance  values  and 
incorrect  estimation  of  the  mean  of  the  data  distribution. 

Type  I  error  (deciding  the  data  do  not  belong  to  an  indentified 
emitter  location  when  in  fact  they  do)  occurs  most  frequently  when: 

1)  The  mean  of  the  data  distribution  is  Incorrectly  estimated  (for 
excunple,  using  the  sample  mode  in  place  of  the  sample  mean  vdien 
the  data  distribution  is  skewed). 

2)  Ihe  sample  size  is  small  and  the  variance-covariance  is  assumed 
known  but  is  actually  estimated. 

3)  The  variance-covariance  is  assumed  to  have  known  values  but  is 
actually  larger,  possibly  due  to  calibration  error. 

Type  II  error  (deciding  the  data  belong  to  an  identified  emitter 
location  when  they  do  not)  occurs  most  frequently  for: 

1 )  small  sample  sizes 

2)  large  error  spread  (variance-covariance) 

3)  underestimated  error  spread  (assuming  the  variance-covariance  is 
one  set  of  values  when  it  fact  it  is  larger) 

The  errors  discussed  above  are  not  isolated.  As  the  flow  diagrams  in 
the  Introduction  Indicate,  these  errors  can  cause  one  to  update  the  wrong 
confidence  ellipse,  form  a  confidence  ellipse  for  a  phantom  emitter  or  fall 
to  form  a  confidence  ellipse  for  an  existing  emitter. 

The  Initial  description  of  the  problem,  and  hence  the  original 
computer  work,  assumed  four  parameters  in  each  data  observation:  position 
coordinates  plus  two  signal  parameters.  Later,  however,  it  was  discovered 
that  there  are  separate  tests  for  signal  parametrics.  In  light  of  this, 
the  computer  programs  were  modified  to  use  only  the  two  position 
coordinates.  Both  sets  of  results  are  contained  in  the  appendices.  The 
more  recent  bivariate  results  are  similar  to  those  obtained  using  our 
original  4-variate  model.  Thus,  conclusions  drawn  from  the  larger  body  of 
4-variate  simulation  results  may  be  carried  over  to  the  more  realistic 
bivariate  case. 

In  the  next  section  of  this  report  the  error  frequencies  are  discussed 
in  more  detail. 


III.  ERROR  RATES  OF  STATISTICAL' TESTS 


VHien  testing  whether  or  not  the  new  data  are  from  a  previously 
identified  location  certain  assumptions  are  made  concerning  the  new  data. 
In  particular,  the  self -correlation  tests  discussed  in  this  report  were 
designed-  for  data  in  which  the  sensor  error  follows  the  Normal,  or 
Gaussian,  probability  law.  The  goal  of  this  casebook  is  to  study  the 
robustness  of  these  self -correlation  tests  when  these  assumptions  are  not 
met  (the  tests  are  described  in  Appendix  F). 

This  section  will  consider  error  rates  as  a  function  of  underlying 
assumptions  concerning  the  new  data  only.  For  the  purpose  of  determining 
the  robustness  of  the  statistical  tests  under  va.'ying  conditions  it  is 
sufficient  to  assume  that  the  location  of  the  "known"  emitter  is  fixed.  In 
other  words,  it  is  reasonable  to  assume  that  the  location  of  the  emitter  in 
the  data  base  is  known  with  complete  certainty.  VIhy  this  is  so,  and  how 

the  results  are  applied  to  the  actual  self-correlation  process,  is 

discussed  in  section  IV. 

Throughout  this  report  we  assume  that  the  new  data  are  Independent 
observations  from  a  multivariate  distribution  »d»ose  mean  (expected)  value 
is  the  true  location  of  an  emitter.  Sensitivity  studies  are  done  for  the 

tests  for  several  different  error  distributions.  Further,  for  each  of 

these  distributions,  studies  are  performed  with  varying  assumptions 
concerning  the  average  spread  of  the  sensor  error  (assumptions  concerning 
the  variance-covariance  of  the  data). 

Test  error  rates  are  estimated  for  four  different  error  distribution 
types,  which  are  sketched  below. 

1.  Normal  (Gaussian).  A  symmetric  distribution. 

2.  Gillls.  A  symmetric  distribution  with  very  fat  tails. 

3.  Triangular-uniform.  A  symmetric  distribution  with  tails  between 
the  Normal  and  the  Glllis  in  size. 

1|.  Gamma.  A  skewed  (non-symmetric)  distribution.  Note  that  the 
mean  and  the  mode  are  unequal  for  skewed  distributions. 

It  has  been  shown  that  for  skewed  distributions  the  sample  estimate 
used  for  emitter  location  can  be  biased  away  from  the  true  location.  For 
this  casebook  we  attempt  to  simulate  possible  effects  of  the  bias.  For  the 
Gamma  distribution  (the  only  skewed  distribution  in  the  casebook),  two 
different  studies  are  performed.  The  first  study  correctly  uses  the  sample 
mean  as  the  unbiased  estimate  for  emitter  location.  The  second  study 
Improperly  uses  the  sample  mode  (most  likely  value)  as  the  estimate  for 
emitter  location.  The  error  Involved  in  using  the  sample  mode  is  shown 
graphically  in  Figure  3.  below,  for  one  specific  bivariate  case. 


FIGURE  3 


Error  resulting  from  xise  of 
Geunma  Mode  as  location  estimate 


•Ellipse  Centered  At  Mode 


X-flxts  (In  ktlometers) 


In  addition,  for  each  of  these  distributions,  test  error  rates  are 
estimated  for  the  following  four  basic  assumptions  about  data  variance- 
covariance  (the  average  spread  of  the  sensor  error); 

Case  1.  The  variance-covariance  of  the  new  data  is  known. 

Case  2.  The  variance-covariance  is  estimated  from  the  data. 


Case  3< 


Case  4. 


The  variance-covariance  is  assumed  to  be  known,  but  the 
estimated  variance-covariance  is  used,  i.e.  no  estimation 
error  is  assumed.  This  is  sm  incorrect  assumption  when 
small  sample  sizes  are  used  in  the  estimation. 

The  variance-covariance  is  assumed  known  but  is  Incorrect  by 
a  certain  percentage. 


The  statistical  tests  used  in  each  of  these  four  cases  are  detailed  in 
Appendix  F. 

Throughout  all  the  cases  discussed  above,  the  sample  sizes  of  the  new 
data  are  varied  in  order  to  assess  the  test  error  rates  as  a  function  of 
sample  size.  Also  the  magnitude  of  the  variance-covariance  matrix  is 
varied  for  each  case  studied. 

Estimated  test  error  rates  for  different  distributions  and  assumptions 
about  the  variance-covariance  of  the  data  were  obtained  by  simulation. 
From  200  to  500  data  sets  were  simulated  for  each  distribution,  assumed 
variance-covariance,  sample  size  and  location.  Simulation  methods  are 
detailed  in  Appendix  G. 

The  following  table  summarizes  the  results  of  the  simulations.  This 
summary  compares  the  errors  obtained  by  varying  the  assumptions 
(distributions,  variance-covariance)  to  those  errors  obtained  vrtien  all 
assumptions  underlying  the  tests  are  met.  Even  if  all  assumptions 
underlying  the  statistical  tests  are  not  met  the  tests  may  be  robust  with 
respect  to  these  unmet  assumptions,  i.e.  they  may  yield  errors  comparable 
to  those  obtained  vdien  all  the  assumptions  are  met. 

Table  1  contains  classification  as  to  Robust  or  Not  Robust  for  those 
distributions  and  variance-covariance  cases  simulated.  If  Type  I  and/or 
Type  II  errors  are  greater  than  those  expected  under  the  correct 
assumptions  then  the  cases  are  classified  as  Not  Robust.  For  further 
details  see  the  simulation  results  listed  under  each  distribution  type. 
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TABLE  1 


Caae 

Normal 

Clllia 

Triangular- 

uniform 

Gemma* 

Gamma  Mode** 

(1) 

Variance-covariance 

known 

Aaaumptiona 

met 

Robuat 

Not  Robuat 

Not  Robuat 

Not  Robuat 

(2) 

Variance-covariance 

eatimated 

Aaaumptiona 

met 

Non  Robusn 

Not  Robuat 

Not  Robuat 

Not  Robuat 

(3) 

Variance-covariance 
aaaumed  known  but 
eatimated 

Not  Robuat 

Not  Robuat 

Not  Robuat 

Not  Robuat 

Not  Robuat 

(4) 

Variance-covariance 
aaaumed  known  but 
in  error  by  a 
percentage. 

Not  Robuat 

Not  Robuat 

Not  Robuat 

Not  Robuat 

Not  Robuat 

■  TTieaa  Caoaia  dlatrlbutlona  are  Oaima  dlatributlona  where  the  aaople  mean  la  uaed  to 
eattmate  the  emitter  location  or  paraoetrica. 

**  Theae  Gamma  dlatrlbutions  are  Gamma  dlatributlona  where  the  aample  mode  (mat  frequent 
data  valuea)  are  uaed  to  eatlmate  the  emitter  location  or  parametrica. 


All  studies  were  performed  by  setting  the  test  probability  of  type  I 
error  (probability  of  deciding  the  new  data  are  not  from  the  identified 
location  vrtien  in  fact  they  are)  at  0.05. 

The  following  five  tables  contain  summary  results  by  distribution. 
For  each  variance -covariance  case,  probabilities  of  type  1  and  type  II 
errors  are  discussed.  More  complete  results  are  given  in  the  appendicies. 


17 


,v 


,V. 

i 


TABLE  2:  NORMAL  DISTRIBUTION 


PROBABILITY  OF 

PROBABILITY  OF 

■ 

CASE 

TYPE  I 

TYPE  II 

ERROR 

ERROR 

■ 

(1) 

Variance-covariance 

Set  at  0.05 

Varies  between  0.90 

known 

and  0  depending  on 

» . 

sample  size,  assumed 

difference  in  location 

V 

of  emitters  and  mag¬ 
nitude  of  variance- 

■? 

covariance 

(2) 

Variance-covariance 

Set  At  0.05 

Varies  between  0.94 

estimated  from  data 

and  0  depending  on 
sample  size,  assumed 

difference  in  location 

of  emitters  and  mag¬ 
nitude  of  variance- 

covariance 

a 

V 

(3) 

Variance-covariance 

Varies  between  0.77 

Varies  between  0.52 

assumed  known  but 

and  0.05  depending 

and  0. 

estimated 

on  sample  size. 

Larger  error  rates  occur 

i 

Larger  error  rates 
occur  for  4-variate 
data 

for  bivariate  data 

m 

(4) 

Variance-covariance 

Varies  between  .22 

Varies  between  0.97 

assumed  known  but 

and  0  depending 

and  0  depending  on 

, 

in  error  by  a 

on  the  magnitude  of 

sample  size,  assumed 

percentage 

variance-covariance 

difference  in  location 

error 

of  emitters  and  mag¬ 
nitude  of  variance- 

covariance  error 

In  all  cases  as  either  the  sample  size 
increases  or  the  distance  between  the 
emitters  increases,  the  probability  of 
Type  II  error  goes  to  0. 


TABLE  3:  GILLIS  DISTRIBUTION 


CASE 

PROBABILITY  OF 

TYPE  I 

ERROR 

PROBABILITY  OF 

TYPE  II 

ERROR 

(1) 

Variance-covariance 

known 

Agrees  with  the  set 
level  of  0.05 

Varies  between  0.71 
and  0  depending  on 
sample  size,  assumed 
difference  in  location 
of  emitters  and  mag¬ 
nitude  of  variance- 
covariance 

(2) 

Variance-covariance 
estimated  from  data 

Varies  between  0.12 
and  0.03 

Varies  between  0.94 
and  0  depending  on 
sample  size,  assumed 
difference  in  location 
of  emitters  and  mag¬ 
nitude  of  variance- 
covariance 

(3) 

Var iance-covar iance 
assumed  known  but 
estimated 

Varies  between  0.70 
and  0.04  depending 
on  sample  size. 

Error  rates  are 
higher  for  4-varlate 
data 

Varies  between  0.68 
and  0. 

Error  rates  are  higher 
for  bivariate  data 

(4) 

Variance-covariance 
assumed  known  but 
in  error  by  a 
percentage 

Varies  between  0.24 
and  0  depending 
on  the  oiagnltude  of 
variance-covariance 
error 

Varies  between  0.92 
and  0  depending  on 
sample  size,  assumed 
difference  in  location 
of  emitters  and  mag¬ 
nitude  of  variance- 
covariance  error 

In  all  cases  as  either  the  sample  size 
increases  or  the  distance  between  the 
emitters  Increases,  the  probability  of 
Type  II  error  goes  to  0. 
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TABLE  i*;  TRIANGULAR-UNIFORM  DISTRIBUTION 


CASE 

PROBABILITY  OF 

TYPE  I 

ERROR 

PROBABILITY  OF 

TYPE  II 

ERROR 

(1) 

Variance-covariance 

known 

Agrees  with  the  set 
level  of  0.05 

Varies  between  0.85 
and  0  depending  on 
sample  size,  assumed 
difference  in  location 
of  emitters  and  mag¬ 
nitude  of  variance- 
covariance* 

(2) 

Variance-covariance 
estimated  from  data 

Agrees  with  the  set 
level  of  0.05 

Varies  between  0.95 
and  0  depending  on 
sample  size,  assumed 
difference  in  location 
of  emitters  and  mag¬ 
nitude  of  variance- 
covariance* 

(3) 

Variance-covariance 
assumed  known  but 
estimated 

Varies  between  0.75 
and  0.05  depending 
on  sample  size. 

Higher  error  rates 
occur  for  4-variate 
data 

Varies  between  0.65 
and  0. 

Higher  error  rates  occur 
for  bivariate  data 

(M) 

Variance-covariance 
assumed  known  but 
in  error  by  a 
percentage 

Varies  between  0.24 
and  0  depending 
on  the  magnitude  of 
var iance-covar iance 
error 

Varies  between  0.94 
and  0  depending  on 
sample  size,  assumed 
difference  in  location 
of  emitters  and  mag¬ 
nitude  of  variance- 
covariance  error 

In  all  cases  as  either  the  sample  size  increases  or  the  distance  between 
the  emitters  increases,  the  probability  of  Type  II  error  goes  to  0. 


Error  is  greater  than  when  the  distribution  is  Normal 


sc 


TABLE  5:  GAMMA  DISTRIBUTION 


CASE 

PROBABILITY  OF 

TYPE  I 

ERROR 

PROBABILITY  OF 

TYPE  II 

ERROR 

(1) 

Variance-covariance 

known 

Agrees  with  the  set 
level  of  0.05 

Varies  between  0.76 
and  0  depending  on 
sample  size,  assumed 
difference  in  location 
of  emitters  and  mag¬ 
nitude  of  variance- 
covariance* 

(2) 

Variance-covariance 
estimated  from  data 

Varies  from  0.12  to 
0.03. 

Small  sample  sizes 
for  bivariate  data 
have  the  highest 
error  rate 

Varies  between  0.92 
and  0  depending  mainly 
on  sample  size,  assumed 
difference  in  location 
of  emitters  and  mag¬ 
nitude  of  variance- 
covariance* 

(3) 

Variance-covariance 
assumed  known  but 
estimated 

Varies  between  0.80 
and  0.06  depending 
on  sample  size. 

Higher  error  rates 
occur  for  4-varlate 
data 

Varies  between  0.42 
and  0  depending  mainly 
on  sample  size,  assumed 
difference  in  location 
of  emitters  and  mag¬ 
nitude  of  variance- 
covariance 

(»») 

Var iance-covar iance 
assumed  known  but 
in  error  by  a 
percentage 

Varies  between  0.24 
and  0.02  depending 
on  the  magnitude  of 
variance-covariance 

error 

Varies  between  0.96 
and  0  depending  on 
sample  size,  assumed 
difference  in  location' 
of  emitters  and  mag¬ 
nitude  of  variance- 
covariance  error 

In  all  cases  as  either  the  sample  size  increases  or  the  distance  between 
the  emitters  increases,  the  probability  of  Type  II  error  goes  to  0. 


*  Error  is  greater  than  when  the  distribution  is  Normal 


CASE 

PROBABILITY  OF 

TYPE  I 

ERROR 

PROBABILITY  OF 

TYPE  II 

ERROR 

(1) 

Variance-covariance 

known 

Varies  between  1.0 
and  0.88  depending 
on  sample  size.  The 
larger  the  sample 
size  the  larger  the 
probability  of 
error 

Almost  0 

(2) 

Var lane  e-covar lance 
estimated  from  data 

Varies  between  1.0 
and  0.09  depending 
on  sample  size.  The 
larger  the  sample 
size  the  larger  the 
probability  of 
error 

Varies  between  0.88 
for  small  sample  size 
and  0  for  larger  saD$)le 
size.  Also  depends  on 
assumed  difference  In 
location  of  emitters* 

(3) 

Var lane e-c ovar lance 
assumed  known  but 
estimated 

Varies  between  1.0 
and  0.93  depending 
mainly  on  sample 
size.  The  larger 
the  sample  size  the 
larger  the  proba¬ 
bility  of  error 

Almost  0 

(M) 

Variance-covariance 
assumed  known  but 

In  error  by  a 
percentage 

Varies  between  1.0 
and  0.76  depending 
on  sample  size  and 
magnitude  of  the 
var 1 anc  e-covar lance 

error 

From  0.24  to  0. 

In  all  cases  the  probability  of  Type  I  error  Increases  as  the  sample  size 
Increases. 


*  Error  Is  greater  than  when  the  distribution  Is  Normal 


IV.  CONFIDENCE  ELLIPSE  CONSIDERATIONS 


In  practice,  "knovm”  emitter  locations  are  stored  in  a  database  as  a 
location  estimate  and  a  confidence  ellipse.  That  is,  vihile  the  emitter  Is 
"known"  to  the  analysis  center,  its  actual  position  is  uncertain.  The 
self-correlation  statistical  tests  take  this  into  account.  However,  for 
the  purpose  of  this  analysis  it  is  reasonable  to  assume  that  the  location 
of  the  "known"  emitter  is  in  fact  known  with  certainty.  The  results 
obtained  under  this  assumption  may  be  applied  to  the  actual  situation  with 
complete  generality.  This  section  will  provide  a  basis  for  this  statement, 
and  explain  how  to  apply  the  results  to  the  more  complex  (two  ellipse) 
case,  which  is  illustrated  in  Figure  2  in  the  Introduction.  Additional 
ellipse  considerations  are  the  effect  of  sample  size  and  the  effect  of 
inaccurate  covariance  matrices. 


1 .  Two  Ellipse  Case 

The  statistical  test  uses  two  location  estimates  and  a  covariance 
matrix  for  each.  However,  a  comprehensive  study  involving  many  pairs  of 
covariance  matrices  would  have  too  many  cases  to  report.  Consequently,  it 
is  important  to  find  classes  of  cases  that  yield  the  same  result.  Here  is 
an  example  of  one  such  class.  The  following  three  pairs  of  covariance 
matrices  yield  the  same  value  of  the  test  statistic  for  any  given  pair  of 
location  estimates  (Note  that  the  following  arguments  hold  for  4-variate 
covariance  matrices  as  well). 


Covariance  for 
Estimate  1 


Covariance  for 
Estimate  2 


Sum  of 
Matrices 


[::]  [:;]  [::] 

1 3  [: :]  [: :] 

•'  [: :]  [: :]  C :] 


These  three  pairs  yield  the  same  value  of  the  test  statistic  because 
the  formula  is  based  on  the  sum  of  the  covariance  matrices.  Since  each 
pair  of  matrices  above  sum  to  the  same  matrix,  each  is  equivalent  where  the 


statistical  test  Is  concerned.  The  third  pair  is  different  from  the  first 
two,  however,  in  that  there  is  no  uncertainty  in  estimate  1.  This  is  the 
same  as  knowing  the  precise  location  of  one  emitter.  Note  that  this 
special  case  may  easily  be  found  for  any  general  two  ellipse  case  by  simply 
taking  the  sum  of  the  covariances  of  the  two  ellipses  and  using  the  sum  in 
the  statistical  test. 

The  statistical  test  cannot  differentiate  between  any  of  the  three 
pairs  of  covariance  matrices  listed  above.  Consequently,  it  is  clear  that 
by  examining  the  special  case  above  one  produces  results  which  apply  to  the 
first  two  pairs  of  matrices.  Indeed,  the  results  will  apply  to  any  pair  of 
covariance  matrices  which  sum  to  the  second  matrix  in  Pair  3«  Further,  the 
results  will  not  vary  between  the  cases  because  of  skewness,  kurtosis,  bias 
and  so  on.  though  uncertainty  in  the  covariance  matrix  may  have  an  effect. 

Appendices  A  through  E  contain  the  results  of  the  simulations  in 
tabular  form.  At  the  top  of  each  table  is  the  "special  case"  matrix  which 
was  used  to  simulate  the  data.  By  the  reasoning  above,  this  table  also 
applies  to  any  two  ellipse  case  in  which  the  covariance  matrices  sum  to  the 
matrix  shown  on  the  table.  Note  that  this  "special  case"  matrix  is  also 
referred  in  this  report  as  the  composite  matrix. 

The  test  used  in  Cases  2  and  3  is  slightly  different.  In  it,  the 
formula  for  the  test  statistic  uses  the  sum  of  two  estimated  covariance 
matrices,  and  the  correct  teat  statistic  is  based  on  the  F  distribution. 
Vfhile  the  results  for  Cases  2  and  3  may  also  be  generalized  in  the  manner 
shown  above,  the  process  itself  is  somewhat  different.  It  is  also  slightly 
more  complex,  and  since  we  have  not  encountered  the  F-test  in  practice  we 
will  not  explain  it  here.  Suffice  it  to  say  that  the  results  listed  in  the 
appendices  for  Cases  2  and  3  are  an  accurate  assessment  of  the  robustness 
of  the  F-test. 


2.  Effect  of  Sample  Size  on  Covariance  Matrices 

In  the  simulations  run  for  this  casebook,  the  data  were  generated  as 
observations  from  a  number  of  different  multivariate  distributions,  such  as 
the  Normal  and  the  Gamma.  The  statistical  tests  assume  that  as  the  sample 
size  increases  our  uncertainty  as  to  the  emitter  location  (size  of  the 
covariance  matrix)  decreases.  In  the  simple  or  "special"  case,  typified  by 
"Pair  3"  in  the  example  in  part  1  of  this  section,  if  the  covariance  matrix 
is  the  known  (or  estimated)  error  owing  to  sensor  Inaccuracy  then  the 
statistical  test  uses  this  covariance  matrix  divided  by  the  appropriate 
sample  size,  n.  For  example,  if  the  sensor  covariance  matrix  is 


and  sample  size  n  -  2,  then  the  matrix  used  in  summing  the  composite  matrix 
is 


I 


Fact:  Two  Emitters 
Decision:  One  Emitter 
(Type  II  Error) 


(In  MIo— tT«> 


ir 


Fact:  Two  Emitters 
Decision:  Two  Emitters 
(Correct  Decision) 
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In  order  to  study  the  effect  of  sample  size  on  the  accuracy  of  the 
statistical  tests  we  bring  sample  size  into  the  calculations  by  simply 
dividing  it  into  the  composite  matrix.  In  practice  there  would  be  two 
sample  sizes;  one  for  the  estimate  in  the  database  and  one  for  the  new 
data.  Time  does  not  permit  considering  all  possible  cases.  Therefore  the 
effect  of  saiq>le  size  is  studied  by  simply  considering  the  frequency  of 
test  errors  where  the  composite  matrix  given  at  the  top  of  each  table  (in 
the  Appendices)  is  divided  by  the  sample  size  noted  in  the  table.  In  all 
of  the  cases  shown,  the  test  matrix  is  quite  small  when  the  sample  size 
Increases  to  100,  i.e.  1/100  of  the  original  covariance  matrix. 

It  should  be  noted  that  the  above  discussion  of  sample  size  depends  on 
the  classic  statistical  methods  of  estimation.  Sample  size  as  used  here  is 
not  directly  applicable  to  the  methods  used  in  the  actual  fix  algorithms  we 
have  studied.  That  is,  it  is  not  a  well*defined  concept  in  the  context  of 
position-fixing  algorithms.  However,  this  does  not  mean  that  it  is  not  a 
useful  concept  in  this  setting.  The  sample  size  used  in  the  Appendices  may 
be  thought  of  as  measuring  the  amount  or  goodness  of  the  Information  that 
was  actually  used  in  calculating  the  position  estimate. 
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3.  The  Effect  of  Inaccurate  Covariance  Matrices 

Though  the  following  issue  is  not  directly  related  to  the  concerns 
in  the  preceding  paragraphs,  it  will  be  discussed  here  because  it  Involves 
the  composite  covariance  matrix  used  in  the  chi-square  statistical  test. 
This  issue  is  the  error  that  can  occur  when  the  data  covariance  matrices 
used  in  calculating  the  cooqposite  matrix  are  not  the  true  covariance 
matrices.  In  this  analysis  (represented  by  Case  4  in  the  appendices)  the 
simplifying  assumption  of  one  emitter  location  known  will  be  used. 
Further,  the  Incorrect  covariance  matrix  used  will  only  be  off  by  a  small 
percentage  of  the  correct  covariance  matrix.  The  purpose  of  this  is  to 
show  the  effect  of  small  errors.  The  covariance  matrix  was  varied  by 
amounts  ranging  from  -^HTTo  tt%. 

Figure  5  shows  two  of  the  possible  errors  resulting  from  the  incorrect 
covariance  matrices.  In  both  of  the  cases  shown,  an  incorrect  decision  was 
made  because  the  confidence  ellipse  was  too  large  or  too  small.  If  the 
correct  matrix  had  been  used,  the  correct  decision  would  have  been  made. 
Note  that  the  matrices  used  in  Figure  5  were  increased  or  decreased  by  20%. 
Such  a  large  factor  was  used  so  that  a  clear  graph  could  be  produced.  In 
fact,  an  Increase  or  decrease  of  as  little  as  1%  or  2%  is  enough  to  cause  a 
significant  number  of  incorrect  decisions. 


Li 
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FIGURE  5 


Fact:  One  Ellipse 
Decision:  Two  Ellipse 
(Due  to  decrease  in  size) 


CASE  4 

Errors  resulting  from 
incorrect  Covariance 
Matrices. 


Error  Variance 


— Reduced  Ellipse  (-20%) 
Correct  Ellipse 


U  lo-3  0.3J 

(Comer  Ellipse) 


Fact:  Two  Ellipses 

Decision:  One  Ellipse 

(Due  to  increase) 


®  Known  Emitter 
O  Unlcnown  Emitter 


O  . 


•  Location  Estimate 


— Enlarged  Ellipse  (+20%) 
Correct  Ellipse 


X-em«  tin  kilo— fr«l 
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APPENDIX  A 


Normal  Distribution 


The  Normal  Distribution  is  the  underlying  distribution  assumed  for  the 
data  In  the  tests  studied.  The  Normal  distribution  is  often  assumed  to  be 
the  distribution  of  equipment  measuring  error. 

Sample  sizes  vary  from  5  to  50.  The  tables  which  follow  are 
categorized  by  the  four  basic  assumptions  about  the  data  variance- 
covariance  (see  Page  14  ).  Both  4-variate  and  bivariate  cases  are  listed. 
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Normal  Distribution,  Case  1 


The  test  assumes  the  underlying  distribution  is  Normal,  that  the 
variance-covariance  matrix  for  the  distribution  is  known,  and  that  the 
location  is  estimated  by  the  mean  of  the  data. 

All  assumptions  are  met. 


Summary 


Probability  of  Type  I  error: 
Set  at  0.05 


Probability  of  Type  II  error; 

Varies  between  0.90  and  0  depending  on  sample  size,  assumed  difference 
in  location  of  emitters  and  magnitude  of  variance-covariance. 


DIFFERENCE  IN  LOCATION  OF  EMITTERS 


TABLE  A-1 


10  0  0 
0  10  0 
0  0  10 
pool 

CASE  1 


NORMAL 

PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 
_ _ Sample  Size 


5 

10 

20 

50 

100 

o 

• 

o 

96 

93 

95 

95 

- 

0.5 

60 

29 

5 

0 

- 

1.0 

5 

0 

0 

0 

- 

1.5 

0 

0 

0 

0 

- 

2.0 

0 

0 

0 

0 

- 

2.5 

0 

0 

0 

0 

- 

3.0 

0 

0 

0 

0 

- 

When  the  difference  in  location  is  0  the  estimated 
probability  of  Type  I  error  is 

,  Percentage 
100 


Thus  for  Sample  Size  10  the  estimated  probability  of 
Type  I  error  is 


1 


0.07. 


The  estimated  probability  of  Type  II  is  simply 
the 


Percentage 

100 


Thus  for  Sample  Size  10  with  a  difference  in  location 
of  emitters  of  0.5  the  probability  of  Type  II  error  is 


0.29. 


When  the  difference  in  location  is  given  as  .5  this  means 
that  the  difference  is  .5  in  each  of  the  "directions”. 


DIFFERENCE  IN  I/DCATION  OF  EMITTERS 


TABLE  A- 2 


E3  1  1 

_  1  2  .3 

“  1  .3  4 

.5  .2 


CASE  1 


NORMAL 

PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

SaiUTDle  Size 


5 

10 

20 

50 

95 

95 

95 

94 

90 

77 

61 

23 

62 

28 

5 

0 

24 

1 

0 

0 

4 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

Difference  in  location  is  the  same  for  each  "direction 
i.e.  .5  is  in  each  direction. 

Estimated  probability  of  Type  I  error  =  1- 
when  difference  in  location  is  0. 

Estimated  probability  of  Type  II  error  = 
when  difference  in  location  is  not  0. 
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TABLE  A- 3 


I 


1  0  0  0“ 

0  10  0 

0  0  10 

0  0  0  1 
m 


I 


CASE  1 


NORMAL 


N* 


i 


t-: 


cn  PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

oi 

a 


DIFFERENCE  IN  LOCATION  OF  EMITTERS 


Normal  Distribution,  Case  2 


The  test  assumes  the  underlying  distribution  Is  Normal,  that  the 
variance-covariance  matrix  Is  estimated  from  the  data,  and  that  the 
location  Is  estimated  by  the  mean  of  the  data. 

All  assumptions  are  met. 


Summarv 


Probability  of  Type  I  error: 
Set  at  0.05 


Probability  of  Type  II  error: 

Varies  between  0.9^  and  0  depending  on  sample  size,  assumed  difference 
In  location  of  emitters  and  magnitude  of  variance-covariance. 


DIFFERENCE  IN  LOCATION  OF  EMITTERS 


^BLE  A-  6 


10  0 


0  10 
0  0  1 


CASE  2 


0  0  0 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

Samisle  Size 


5 

10 

20 

50 

96 

94 

94 

95 

90 

65 

9 

0 

90 

7 

0 

0 

84 

0 

0 

0 

76 

0 

0 

0 

74 

1  ^ 

0 

0 

66 

0 

0 

0 

Difference  in  location  is  the  same  for  each  "direction 
i.e.  .5  is  in  each  direction. 

Estimated  probability  of  Type  I  error  *  1- 
when  difference  in  location  is  0. 


Estimated  probability  of  Type  II  error 
when  difference  in  location  is  not  0. 


Percentaqe 

100 


DIFFERENCE  IN  LOCATION  OF  EMITTERS 


TABLE  A-  7 


3  11. 

1  2  .3  . 

1.3  4 

5.2  1 


c. 


2 


NORMAL 

PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

Samole  Size  _ 


5 

10 

20 

50 

100 

HI 

96 

95 

96 

B 

- 

B 

94 

88 

67 

- 

91 

62 

8 

- 

B 

87 

30 

2 

WM 

- 

2.0 

90 

6 

0 

0 

- 

in 

• 

84 

0 

0 

- 

3.0 

82 

0 

0 

- 

piffsrence  in  location  is  the  seune  for  each  "direction 
i.e.  .5  is  in  each  direction. 

Estimated  probability  of  Type  I  error  »  1-  — 
when  difference  in  location  is  0. 

Estimated  probability  of  Type  II  error  * 
when  difference  in  location  is  not  0. 


DIFFERENCE  IN  LOCATION  OF  EMITTERS 


CASE 


TABLE  A- 8 


2-Bivariate 


NORMAL 

PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

Sample  Size 

I  I  P  i 


5 

10 

20 

50 

100 

IB 

92 

95 

98 

97 

96 

B 

84 

67 

26 

0 

0 

62 

14 

0 

0 

0 

B 

35 

0 

0 

0 

0 

2.0 

12 

0 

0 

0 

0 

2.5 

2 

0 

0 

0 

0 

3.0 

1 

0 

0 

0 

0 

Difference  in  location  is  the  same  for  each  "direction", 
i.e.  .5  is  in  each  direction. 

Note  that  two  sets  of  simulations  were  run  for  Sample  Size  5 
Estimated  probability  of  Type  I  Error  =  1- 
when  difference  in  location  is  0. 

Estimated  probability  of  Type  II  Error  «  Percentage 
when  difference  in  location  is  not  0. 


DIFFERENCE  IN  LOCATION  OF  EMITTERS 


TABLE  A-  9 


5  2 
2  1 


CASE  2-Bivariate 


NORMAL 

PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

Seunple  Size 


10 

20 

50 

100 

99 

96 

96 

92 

56 

24 

0 

0 

8 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

Difference  in  location  is  the  same  for  each  "direction", 
i.e.  .5  is  in  each  direction. 

Note  that  two  sets  of  simulations  were  run  for  San^le  Size  5 
Estimated  probability  of  Type  I  Error  *  1- 
when  difference  In  location  is  0. 


Estimated  probability  of  Type  II  Error 
when  difference  in  location  is  not  0. 


Percentage 


Normal  Distribution,  Case  3 


The  test  assumes  the  underlying  distribution  Is  Normal,  that  the 
variance-covariance  matrix  Is  known,  and  the  location  la  estimated  by  the 
mean  of  the  data. 

The  first  and  third  assumptions  are  met.  Itowever,  the  variance- 
covariance  matrix  used  In  the  test  Is  estimated  from  the  data. 


Summary 


Probability  of  Type  I  error; 

Varies  between  0.77  and  0.05  depending  on  sample  size.  Larger  error 
rates  occur  for  iJ-varlate  data. 

Probability  of  Type  II  error; 

Varies  between  0.52  and  0.  Larger  error  rates  occur  for  bivariate 


data 


TABLE  A-10 


10  0 


0  10 

^001 


CASE  3 


0  0  0 


NORMAL 

PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

Samcle  Size 


20 

50 

86 

95 

4 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

Difference  in  location  is  the  same  for  each  "direction 
i.e.  .5  is  in  each  direction. 


Estimated  probability  of  Type  I  error  »  i. 
when  difference  in  location  is  0. 


Percentage 


Estimated  probability  of  Type  II  error 
when  difference  in  location  is  not  0. 


Percentage 
- T60 


V*  I 

O*  I 


I.I’AI’VW 


DIFFliRENCE  IN  LOCATION  OF  EMITTERS 


TABLE  A-11 


3  11 

12.3 

1.3  4 

5.2  1 


CASE  3 


NORMAL 

PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

Sample  Size  _ 


5 

10 

20 

50 

100 

0.0 

24 

76 

91 

94 

- 

0.5 

18 

59 

45 

17 

- 

1.0 

10 

10 

4 

0 

- 

1.5 

3 

1 

0 

0 

- 

2.0 

2 

0 

0 

0 

- 

2.5 

0 

0 

0 

0 

- 

3.0 

0 

0 

0 

0 

- 

Difference  in  location  is  the  same  for  each  "direction 
i.e.  .5  is  in  each  direction. 


Estimated  probability  of  Type  I  error  *  1- 
when  difference  in  location  is  0. 


Estimated  probability  of  Type  II  error  = 
when  difference  in  location  is  not  0. 


Percentage 

100 


TABLE  A  -  12 


CASE  3-Bivariate 


NORMAL 

PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

Samcle  Size 


5 

10 

20 

0.0 

75 

87 

89 

in 

• 

o 

48 

38 

20 

1.0 

14 

3 

0 

1.5 

2 

0 

0 

o 

t 

0 

0 

0 

2.5 

0 

0 

0 

3.0 

0 

0 

1  0 

Difference  in  location  is  the  same  for  each  "direction", 
i.e.  .5  is  in  each  direction. 

Note  that  two  sets  of  simulations  were  run  for  Seunple  Size  5 
Estimated  probability  of  Type  I  Error  =  1-  ~ 
when  difference  in  location  is  0. 


Estimated  probability  of  Type  II  Error  = 
when  difference  in  location  is  not 


Percent aae 

TUO 


TABLE  A-13 


E- 


5  2 
2  1 


CASE  3-Bivariate 


NORMAL 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

Samole  Size 


Difference  in  location  is  the  same  for  each  "direction", 
i.e.  .5  is  in  each  direction. 

Note  that  two  sets  of  simulations  were  run  for  Sample  Size  5 


Estimated  probability  of  Type  I  Error  *  1- 

when  difference  in  location  is  0 . 

Estimated  probability  of  Type  II  Error  *  — 
when  difference  in  location  is  not  0 . 


Percentage 


Percentage 

Too 


5 


Normal  Distribution,  Case  ^ 


The  test  assumes  the  underlying  distribution  is  Normal,  that  the 
variance-covariance  matrix  is  knovm,  and  that  the  location  is  estimated  by 
the  mean  of  the  data. 

The  first  and  third  assumptions  are  met.  However,  the  variance- 
covariance  matrix  used  in  the  test  is  varied  from  the  true  by  a  percentage 
to  4?).  This  could  be  thought  of  as  an  effect  of  calibration  error. 


Summary 


Probability  of  Type  I  error: 

Varies  between  0.22  and  0  depending  on  the  magnitude  of  the 
variance-covariance  error. 


Probability  of  Type  II  error: 


Varies  between  0.97  and  0  depending  on  sample  size,  assumed  difference 
in  location  of  emitters  and  magnitude  of  the  variance-covariance  error. 


Change  in  Covaraince  Matrix 
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TABLE  A- 14 


CASE  4 


10  0  0 
0  10  0 
0  0  10 
0  0  0  1 


NORMAL 

PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 


Sample  Size 


%  Change 

Diff.  in  Log. 

Of.  Emitter 

5 

10 

20 

50 

100 

0 

83 

80 

79 

77 

- 

0.5 

38 

10 

0.5 

0 

- 

-4  % 

1.0 

0.5 

0 

0 

0 

- 

2.0 

0 

0 

0 

0 

0 

86 

85 

83 

85 

- 

0.5 

40 

10 

2 

0 

- 

1.0 

2 

0 

0 

0 

- 

2.0 

0 

0 

0 

0 

- 

0 

86 

89 

84 

92 

- 

-2% 

0.5 

1.0 

38 

2 

21 

0 

2 

0 

0 

0 

2.0 

0 

0 

0 

0 

0 

98 

MSSM 

98 

98 

- 

0  a 

0.5 

74 

3 

0 

- 

1.0 

10 

0 

0 

- 

2.0 

0 

0 

0 

0 

- 

0 

97 

99 

98 

98 

- 

0.5 

78 

46 

10 

0 

- 

3% 

1.0 

10 

0 

0 

0 

- 

2.0 

0 

0 

0 

0 

— 

0 

99 

100 

99 

— 

0.5 

78 

44 

8 

0 

— 

4% 

1.0 

0 

0 

0 

- 

2.0 

0 

0 

0 

0 

— 

TABLE  A- 15 


CASE  4 


NORMAL 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 


Sample  Size 


%  Change 


Diff.  in  Loc 
Of.  Emitter 


X 

•H 

I 

S 

0) 

V 

o 

c 

B 

•H 

10 

cd 

u 

(0 

!  rV 

> 

1 

0 

A- 

O 

c 

1  1 

2  .3 

3  4 


Estimated  probability  of  Type  I  error  =  i 
Estimated  probability  of  Type  II  error  = 


For  0%  change  see  Table  A-2  results. 


Percentage 

100 

Percentage 

100 


'A 


Change  in  Covariance  Matrix 


TABLE  A- 16 


CASE  4 -Bivariate 


1  0 
1 


NORMAL 

PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 


%  Change 


Sample  Size 


Oiff.  in  Loc 
Of.  Emitter 


99 

98 

96 

98 

100 

82 

66 

36 

2 

0 

36 

4 

0 

0 

0 

0 

0 

0 

0 

0 

Estimated -probability  of  Type  I  error  =  1- 
Estimated  probability  of  Type  II  error  =  - 


Percentaae 

100 

Percentaae 

100 


APPENDIX  B 


Gillis  Distribution 


In  the  simulation  results  which  follow,  the  test  distribution  is  the 
Gillis  distribution. 

T^e  Gillis  distribution  is  a  very  fat-tailed  distribution  based  on  the 
cosine  function. 


The  standard  univariate  Gillis  sketched  above  has  the  following  density 
function. 


[cos(x) 
cos(ir- 


)/2  0  S  X  S  ir/2 

-x)/2  »/2  S  X  S  IT 


For  the  H-variate  cases,  the  data  are  vectors  with  four  components, 
each  of  »*ich  is  an  independent  Gillis  random  variable.  For  the  bivariate 
case,  the  data  are  ordered  pairs,  the  two  components  of  which  may  be 
dependent. 

Sample  sizes  vary  from  5  to  100.  The  tables  which  follow  are 
categorized  by  the  four  basic  assumptions  about  the  data  variance- 
covariance  (see  Page  14). 


50 


i 


I 


r-'» 


S.\ 


Gillis  Distribution,  Case  1 


The  test  assumes  the  underlying  distribution  is  Normal, 
varlance“Covar lance  matrix  for  the  distribution  Is  known,  and 
location  Is  estimated  by  the  mean  of  the  data. 

The  first  assumption  Is  not  met.  The  data  follow 
distribution,  not  the  Normal  distribution.  However,  the  second 
assumptions  are  met. 


Summary 


Probability  of  Type  I  error: 

Agrees  with  the  set  level  of  0.05. 


Probability  of  Type  II  error: 

Varies  between  0.71  and  0  depending  on  sample  size,  assumed 
in  location  of  emitters  and  magnitude  of  variance-covariance. 


that  the 
that  the 


a  Gillis 
and  third 


difference 


Glllis  Distribution,  Case  2 


The  test  assumes  the  underlying  distribution  Is  Normal,  that  the 
variance -CO variance  matrix  Is  estimated  from  the  data,  and  that  the 
location  Is  estimated  by  the  mean  of  the  data. 

The  first  assumption  Is  not  met.  The  data  follow  a  Glllis 
distribution,  not  the  Normal  distribution.  However,  the  second  and  third 
assumptions  are  met. 


Summary 


Probability  of  Type  I  error: 

Varies  between  0.12  and  0.03. 


Probability  of  Type  II  error: 


I*; 


20 

50 

100 

92 

92 

B 

16 

0 

B 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

DIFFERENCE  IN  LOCATION  OF  EMITTERS 


TABLE  B-4 


10  0 
0  2  0 
0  0  3 
0  0  0 


CASE  2 


GILLIS 

PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

SaiRDle  Size 


20 

50 

96 

96 

52 

2 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

Difference  in  location  is  the  same  for  each  "direction 
i.e.  .5  is  in  each  direction. 

Estimated  probsdiility  of  Type  I  error  =  1- 
when  difference  in  location  is  0. 

Estimated  probability  of  Type  II  error  =  Percentage 

100 

when  difference  in  location  is  not  0. 


DIFFERENCE  IN  LOCATION  OF  EMITTERS 


TABLE  B- 5 


CASE 


0 

1 


2-Bi variate 


GILLIS 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 


Sample  Size 


5 

10 

20 

50 

100 

88 

96 

98 

97 

97 

B 

86 

67 

30 

0 

0 

75 

4 

0 

0 

0 

1.5 

48 

0 

0 

0 

0 

to 

• 

o 

22 

0 

0 

0 

0 

2.5 

4 

0 

0 

0 

0 

3.0 

0 

0 

0 

0 

0 

Difference  in  location  is  the  same  for  each  "direction", 
i.e.  .5  is  in  each  direction. 

Note  that  two  sets  of  simulations  were  run  for  Sample  Size  5. 
Estimated  probability  of  Type  I  Error  =  1- 
when  difference  in  location  is  0. 


Estimated  probability  of  Type  II  Error 
when  difference  in  location  is  not  0 . 


Percentage 

Too 
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Glllls  Distribution,  Case  3 


The  test  assumes  the  underlying  distribution  is  Normal,  that  the 
variance-covariance  matrix  is  known,  and  the  location  is  estimated  by  the 
mean  of  the  data. 

The  first  two  assumptions  are  not  met.  The  data  follow  a  Gillis 
distribution,  not  the  Normal  distribution,  and  the  variance-covariance 
matrix  used  in  the  test  is  estimated  from  the  data,  not  known.  However, 
the  third  assumption  is  met. 


Summary 


Probability  of  Type  I  error: 

Varies  between  0.70  and  0.04  depending  on  sample  size.  Error  rates 
are  higher  for  4-variate  data. 

Probability  of  Type  II  error: 

Varies  between  0.68  and  0.  Error  rates  are  higher  for  bivariate  data. 


DirFERENCB  IN  IXDCATION  OF  EMITTERS 


TABLE  B-7 


0  0| 
0  0 
1  0 
0  1 


CASE  3 


GILLIS 

PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

SaiTiDle  Size 


5 

10 

20 

50 

100 

o 

• 

o 

30 

70 

87 

94 

96 

in 

• 

o 

15 

17 

4 

0 

0 

1.0 

1 

0.5 

0 

0 

0 

1.5 

0 

0 

0 

0 

0 

2.0 

0 

0 

0 

0 

0 

2.5 

0 

0 

0 

0 

0 

3.0 

0 

0 

0 

0 

0 

Difference  in  location  is  the  same  for  each  "direction", 
i.e.  .5  is  in  each  direction. 


Estimated  probability  of  Type  I  error  =  1- 
when  difference  in  location  is  0. 


Percentaae 

loo 


Estimated  probability  of  Type  II  error  = 
when  difference  in  location  is  not  0. 


Percentaae 

100 
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DIFFERENCE  IN  LOCATION  OF  EMITTERS 


TABLE  B-8 


DIFFERENCE  IN  LOCATION  OF  EMITTERS 


TABLE  B-9 


CASE  3-Bivariate 


GILLIS 

PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

Samole  Size 


20 

50 

100 

92 

94 

94 

18 

0 

0 

2 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

Difference  in  location  is  the  same  for  each  "direction", 
i.e.  .5  is  in  each  direction. 

Note  that  two  sets  of  simulations  were  run  for  Sample  Size  5 
Estimated  probability  of  Type  I  Error  =  1- 


when  difference  in  location  is  0. 

Estimated  probability  of  Type  II  Error 
when  difference  in  location  is  not  0 . 


Percentage 


DIFFERENCE  IN  LOCATION  OF  EMITTERS 


TABLE  B-  10 


CASE 


2 

1 

. 


-Bivariate 


GILLIS 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 


Samtsle  Size 


5 

10 

20 

50 

100 

0.0 

78 

88 

89 

96 

90 

0.5 

65 

68 

48 

8 

0 

1.0 

39 

18 

2 

0 

0 

1.5 

12 

0 

0 

0 

0 

2.0 

0 

0 

0 

0 

0 

2.5 

0 

0 

0 

0 

0 

3.0 

0 

0 

0 

0 

0 

Difference  in  location  is  the  same  for  each  "direction", 
i.e.  .5  is  in  each  direction. 

Note  that  two  sets  of  simulations  were  run  for  Sample  Size  5. 
Estimated  probability  of  Type  I  Error  *  1- 


when  difference  in  location  is  0. 

Estimated  probability  of  Type  II  Error  = 
when  difference  in  location  is  not  0 . 


Percentaae 


/ 


Glllls  Distribution,  Case  4 


The  test  assumes  the  underlying  distribution  is  Normal,  that  the 
variance -covariance  matrix  is  known,  and  that  the  location  is  estimated  by 
the  mean  of  the  data. 

The  first  two  assumptions  are  not  met.  The  data  follow  the  Gillis 
distribution,  not  the  Normal  distribution,  and  the  variance-covariance 
matrix  used  in  the  test  is  varied  from  the  true  by  a  percentage  (-4it  to 
4J).  This  could  be  thought  of  as  an  effect  of  calibration  error.  However, 
the  third  assumption  is  met. 


Summary 


Probability  of  Type  I  error: 

Varies  between  0.24  and  0  depending  on  the  magnitude  of  the  variance- 
covariance  error. 


Probability  of  Type  II  error: 

Varies  between  0.92  and  0  depending  on  sample  size,  assumed  difference 
in  location  of  emitters  and  magnitude  of  the  variance-covariance  error. 
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TABLE  B-11 


CASE  4 


E- 


.  10  0  0 

_  0  1  0  0 

0  0  10 

)  0  0  0  1 


GILLIS 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 


%  Change 


Oiff.  in  Log. 
Of.  Emitter 


Sample  Size 


3% 

0 

0.5 

1.0 

2.0 

4% 

0 

0.5 

1.0 

2.0 

Estimated  probability  of  Type  I  error  =  1- 
Estimated  probability  of  Type  II  error  =  Percent^e 
For  0%  change,  see  Table  B-1  results. 


TABLE  B-12 


CASE  4 


E- 


10  0  0 
0  2  0  0 
0  0  3  0 
0  0  0  4 


GILLIS 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 


Sample  Size 


Diff.  in  Log. 
%  Change  Of.  Emitter 


Estimated  probability  of  Type  I  error  =  1-  ^^loo 

P62rc3n^&cr@ 

Estimated  probability  of  Type  II  error  =  - — 


O  O  O  VO  o  o 
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TABLE  B-13 


i 


CASE  4-Bivariate 


E- 


I 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 


67 


I 

I 


APPENDIX  C 

Triangular-uniform  Distribution 


In  the  simulation  results  which  follow,  the  test  distribution  is  the 
Triangular-uniform  distribution. 


The  Triangular-uniform  is  a  medium  fat-tailed  distribution  (see 
sketch).  The  four  components  (Xj^,X2tX3.X4) .  are  generated  essentially  as 
follows; 


V. 


X  B  u 

4  “3 

where  Uj^, 


(uniform) 

U2  (triangular) 

(uniform) 

u^  (triangular) 

U2.  Uj.  and  u^  are  independent  uniform  random  variables. 


I 


and  X2  are  dependent  and  x^  and  x^  are  dependent. 

The  formulas  are  identical  for  the  bivariate  simulations,  except  that 
only  the  first  two  components  are  used. 

Sample  sizes  vary  from  5  to  100.  The  tables  which  follow  are 
categorized  by  the  four  basic  assumptions  about  the  data  variance- 
covariance  (see  Page  14). 


Trianaular  Distribution 


Uniform  Distribution 


Triangular-uniform  Distribution,  Case  1 


The  test  assumes  the  underlying  distribution  is  Normal,  that  the 
variance-covariance  matrix  for  the  distribution  is  known,  and  that  the 
location  is  estimated  by  the  mean  of  the  data. 

The  first  assumption  is  not  met.  The  data  follow  a  Triangular-uniform 
distribution,  not  the  Normal  distribution.  However,  the  second  and  third 
assumptions  are  met. 


Summary 


Probability  of  Type  I  error: 

Agrees  with  the  set  level  of  0.05. 


Probability  of  Type  II  error: 

Varies  between  0.85  and  0  depending  on  sample  size,  assumed  difference 
in  location  of  emitters  and  magnitude  of  variance-covariance.  Error  is 
greater  than  when  the  distribution  is  Normal. 


TABLE  C-1 


CASE  1 


TRIANGULAR-UNIFORM 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 


Samole  Size 


Difference  in  location  is  the  same  for  each  "direction 
i.e.  .5  is  in  each  direction. 


Estimated  probability  of  Type  I  error  =  1- 
when  difference  in  location  is  0. 


Percentage 

- 


Estimated  probability  of  Type  II  error  = 
when  difference  in  location  is  not  0. 


DIFFERENCE  IN  LOCATION  OF  EMITTERS 


TABLE  C-2 


Triangular-uniform  Distribution,  Case  2 


The  test  assumes  the  underlying  distribution  is  Normal,  that  the 
variance-covariance  matrix  is  estimated  from  the  data,  and  that  the 
location  is  estimated  by  the  mean  of  the  data. 

The  first  assumption  is  not  met.  The  data  follow  a  Triangular-uniform 
distribution,  not  the  Normal  distribution.  However,  the  second  and  third 
assumptions  are  met. 


Summary 


Probability  of  Type  I  errors 

Agrees  with  the  set  level  of  0.05. 


Probability  of  Type  II  error: 

Varies  between  0.95  and  0  depending  on  sample  size,  assumed  difference 
in  location  of  emitters  and  magnitude  of  variance-covariance.  Error  is 
greater  than  when  the  data  follow  the  Normal  distribution. 


TABLE  C-  3 


CASE  2 


TRIANGULAR-UNIFORM 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

ScunDle  Size 


5 

10 

20 

50 

100 

95 

92 

96 

94 

95 

95 

78 

38 

2 

0 

89 

35 

0 

0 

0 

88 

2 

0 

0 

0 

83 

0 

0 

0 

0 

- 

- 

- 

- 

- 

Difference  in  location  is  the  same  for  each  "direction 
i.e.  .5  is  in  each  direction. 


Estimated  probability  of  Type  I  error  =  1- 
when  difference  in  location  is  0. 

Estimated  probability  of  Type  II  error  = 
when  difference  in  location  is  not  0. 


DIFFERENCE  IN  LOCATION  OF  EMITTERS 


TABLE  C-  5 


CASS  2 -Bivariate 


TR I ANGULAR-UNI FORM 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

Samnle  Size 


10 

20 

50 

100 

96 

94 

96 

96 

78 

50 

5 

0 

32 

2 

0 

0 

2 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

Difference  in  location  is  the  same  for  each  "direction", 
i.e.  .5  is  in  each  direction. 


Note  that  two  sets  of  simulations  were  run  for  Sample  Size  5 
_ _ _ ..  _ ,  Percentaqe 


Estimated  probability  of  Type  I  Error  =  1- 
when  difference  in  location  is  0. 


Estimated  probability  of  Type  II  Error  = 
when  difference  in  location  is  not  0. 


Percentage 


-V  >-.>v  v;,  ■ .  y: 


75 


i 

Triangular-uniform  Distribution,  Case  3 


The  test  assumes  the  underlying  distribution  is  Normal,  that  the 
variance-covariance  matrix  is  known,  and  the  location  is  estimated  by  the 

P  mean  of  the  data. 

The  first  two  assumptions  are  not  met.  The  data  follow  a  Triangular- 
uniform  distribution,  not  the  Normal  distribution,  and  the  variance- 
covariance  matrix  used  in  the  test  is  estimated  from  the  data,  not  known. 
However,  the  third  assumption  is  met. 

? 

M  * 


Summary 


Probability  of  Type  I  error: 

Varies  between  0.75  and  0.05  depending  mainly  on  sample  size.  Higher 
error  rates  occur  for  4-variate  data. 


I 


.•j 


S' 


'i 


i 


i 


Probability  of  Type  II  error: 

Varies  between  0.65  and  0.  Higher  error  rates  occur  for  bivariate 

data. 
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I 

V 

i 

s 

3 
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V 

% 
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LOCATION  OF  EMITTERS 


TABLE  C-6 


CASE  3 


TRIANGULAR- UNI FORM 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

Samole  Size 


5 

10 

25 

67 

16 

36 

5 

0 

.25 

0 

0 

0 

- 

- 

50 

100 

93 

95 

0 

0 

0 

0 

0 

0 

0 

0 

- 

- 

Difference  in  location  is  the  same  for  each  "direction 
i.e.  .5  is  in  each  direction. 

Estimated  probability  of  Type  I  error  =  1- 
when  difference  in  location  is  0. 

Estimated  probability  of  Type  II  error  = 
when  difference  in  location  is  not  0. 


DIFFERENCE  IN  LOCATION  OF  EMITTERS 


TABLE  C-7 


77 


E'  1  .35  0  0 

=  .35  .500 

0  0  .5  .35 

0  0  .35  1 


CASE  3 


TRIANGULAR- UNI FORM 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 


Scunole  Size 


5 

10 

20 

50 

100 

0.0 

30 

74 

88 

93 

mm 

0.5 

11 

21 

1.5 

0 

Hi 

1.0 

.5 

0 

0 

0 

H 

1.5 

0 

0 

0 

0 

to 

• 

o 

0 

0 

0 

0 

0 

2.5 

- 

- 

- 

- 

3.0 

— 

- 

— 

IB 

Difference  in  location  is  the  same  for  each  "direction", 
i.e.  .5  is  in  each  direction. 

Estimated  probability  of  Type  I  error  ®  1- 
when  difference  in  location  is  0. 

Estimated  probability  of  Type  II  error  ® 
when  difference  in  location  is  not  0. 


DIFFERENCE  IN  LOCATION  OF  EMITTERS 


TABLE  C~  8 


CASE 


-Bivariate 


TRIANGULAR- UNIFORM 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 


SaniTDle  Size 


5 

10 

20 

50 

100 

72 

83 

93 

92 

96 

n 

65 

65 

44 

8 

0 

34 

15 

0 

0 

0 

B 

9 

0 

0 

0 

0 

2.0 

0 

0 

0 

0 

0 

2.5 

0 

0 

0 

0 

0 

3.0 

0 

0 

0 

0 

0 

Difference  in  location  is  the  same  for  each  "direction", 
i.e.  .5  is  in  each  direction. 

Note  that  two  sets  of  simulations  were  run  for  Sample  Size  5 
Estimated  probability  of  Type  I  Error  *  1- 


when  difference  in  location  is  0 . 

Estimated  probability  of  Type  II  Error 
when  difference  in  location  is  not  0 . 


Percent aae 

loo  " 
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Triangular-uniform  Distribution,  Case 


The  test  assumes  the  underlying  distribution  is  Normal,  that  the 
variance-covariance  matrix  is  knovm,  and  that  the  location  is  estimated  by 
the  mean  of  the  data. 

The  first  two  assumptions  are  not  met.  The  data  follow  the  Triangular- 
uniform  distribution,  not  the  Normal  distribution,  and  the  variance- 
covariance  matrix  used  in  the  test  is  varied  from  the  true  by  a  percentage 
(-4$  to  4J).  This  could  be  thought  of  as  an  effect  of  calibration  error. 
However,  the  third  assumption  is  met. 


Summary 


Probability  of  Type  I  error: 

Varies  between  0.24  and  0  depending  on  the  magnitude  of  the  variance- 
covariance  error. 


Probability  of  Type  II  error: 

Varies  between  0.94  and  0  depending  on  sample  size,  assumed  difference 
in  location  of  emitters  and  magnitude  of  the  variance-covariance  error. 


in  Covariance  Matrix 


80 


table  C-9. 


CASE  4 


T'pTAMCnTAR^UNIFORM 


1.7  0 

7  10 

0  0  1 
0  0.7 


percentage  of  locations  accepted  as  the  same 

Sample  Size 


%  Change 


percentage 

A.  of  Type  I  error  =  1  loO 

Estimated  probabili  y  Percentage 

w  v,4i-!rv  of  Type  II  error  =  -  100 

Estimated  probability  of  lyp 

For  0%  change  see  Table  C-1. 


TABLE  C-  10 
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CASE  4 -Bivariate 


E- 


1  .7 

.7  1 


TRIANGULAR-UNIFORM 

PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

Saunple  Size 


^C6n^dCT0 

Estimated -probability  of  Type  I  error  =  1-  — ^^00 - - 

Percentaoe 

Estimated  probability  of  Type  II  error  =  - ^ 


Diff .  in  Log. 

%  Change 

Of.  Emitter 

5 

10 

20 

50 

100 

0 

86 

84 

83 

86 

85 

-4% 

0.5 

67 

42 

22 

4 

0 

1.0 

24 

6 

0 

0 

0 

X 

2.0 

0 

0 

0 

0 

0 

-l-t 

u 

AJ 

0 

86 

88 

88 

88 

84 

nj 

Z 

-3% 

0.5 

1.0 

71 

32 

49 

9 

24 

0 

2 

0 

0 

0 

0) 

u 

2.0 

0 

0 

0 

0 

0 

c 

0 

90 

94 

92 

■KnH 

•H 

-2% 

0.5 

75 

60 

30 

4 

u 

(Q 

1.0 

37 

7 

0 

0 

> 

0 

2.0 

0 

0 

0 

0 

BB 

u 

0 

97 

100 

97 

98 

98 

c 

2% 

0.5 

86 

76 

50 

10 

0 

•H 

1.0 

52 

20 

0 

0 

0 

0) 

cr 

c 

ns 

2.0 

2 

0 

0 

0 

0 

0 

99 

98 

100 

98 

98 

£ 

U 

3% 

0.5 

94 

82 

52 

11 

0 

1.0 

60 

22 

2 

0 

0 

dP 

2.0 

1 

0 

0 

0 

0 

0 

100 

99 

98 

98 

4% 

0.5 

92 

78 

60 

16 

■El 

1.0 

61 

29 

2 

0 

2.0 

1 

0 

0 

0 
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APPENDIX  D 


Gamma  Distribution 


In  the  simulation  results  which  follow,  the  test  distribution  is  the 
Gamma  distribution. 

The  Gamma  distribution  is  a  skewed  distribution  whose  mean  (expected) 
value  is  to  the  right  of  the  mode  (most  frequent)  value. 


The  density  sketched  above  follows  the  general  equation 


f(x) 


Three  types  of  this  density  were  simulated  in  the  4-variate  case: 

a)  n-2,  coefficient  of  skewness  •  1.414. 

b)  n«4,  coefficient  of  skewness  -  1.000. 

c)  n-9,  coefficient  of  skewness  -  0.666. 

(Note  that  the  coefficient  of  skewness  for  symmetric  distributions  such  as 
Normal,  Gillis,  or  Triangular-uniform  is  zero).  As  n  increases  the  Gamma 
distribution  approaches  the  Normal  distribution. 


These 


The  data  are  vectors  with  four  components 
components  are  generated  as  follows: 


*1- Vi 

X2  -  >  a3y2 


*3  ’  ^^3 

*A  -  ^5^3  *  ^6^4 


where  y^,  yu  .  y^  and  y^  are  independent  gamma  variables  and  the  ai's  are 
constants.  TFor  each  simulation  case  the  y*s  were  generated  with  the  same 
n  value  but  with  varying  lambda  values).  For  bivariate  cases,  only  the 
first  two  components  were  used. 

Sample  sizes  vary  from  5  to  100.  The  tables  which  follow  are 
categorized  by  the  four  basic  assumptions  about  the  variance-covariance 
(see  Page  14  ). 


Gamma  Distribution,  Case  1 


The  test  assumes  the  underlying  distribution  is  Normal,  that  the 
variance-covariance  matrix  for  the  distribution  is  known,  and  that  the 
location  is  estimated  by  the  mean  of  the  data. 

The  first  assumption  is  not  met.  The  data  follow  a  Gamma 
distribution,  not  the  Normal  distribution.  However,  the  second  and  third 
assumptions  are  met. 


Summary 


Probability  of  Type  I  error; 

Agrees  with  the  set  level  of  0.05. 


Probability  of  Type  II  error; 

Varies  between  0.76  and  0  depending  on  sample  size,  assumed  difference 
in  location  of  emitters  and  magnitude  of  variance-covariance.  Error  is 
greater  than  when  the  distribution  is  Normal. 


TABLE  D-1 


'1,1  0  0 
.7100 
0  0  1  .7 

0  0  .7  1 


CASE  1 


GAMMA  (n  =  2) 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

Sample  Size 


Difference  in  location  is  the  same  for  each  "direction 
i.e.  .5  is  in  each  direction. 


Estimated  probability  of  Type  I  error  *  1- 
when  difference  in  location  is  0. 


Percentage 
- 100 


Estimated  probability  of  Type  II  error  = 
when  difference  in  location  is  not  0. 
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5 

10 

20 

50 

100 

97 

94 

96 

97 

96 

67 

56 

14 

2 

0 

15 

2 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

on  1 
irec 

f  si 


K 


[iS] 


87 


i 


Gamma  Distribution,  Case  2 


U  f, 

K'' 


fr: 


Sr 


r.- 

Ill 


R 

hi. 


rr: 

R 


The  test  assumes  the  underlying  distribution  is  Normal,  that  the 
variance-covariance  matrix  is  estimated  from  the  data,  and  that  the 
location  is  estimated  by  the  mean  of  the  data. 


The  first  assumption  is  not  met. 
distribution,  not  the  Normal  distribution, 
assumptions  are  met. 


The  data  follow  a  Gamma 
However,  the  second  and  third 


Probability 

of  Type  I  error; 

r^\ 

u  ** 

Varies 

from  0.12  to  0.03 

1  w* 

£ 

the  highest 

error  rate. 

Probability 

of  Type  II  error; 

Summary 


Small  sample  sizes  for  bivariate  data  have 


Varies  between  0.92  and  0  depending  mainly  on  sample  size,  assumed 
difference  in  location  of  emitters  and  magnitude  of  variance-covariance. 
Error  is  greater  than  when  the  data  follow  the  Normal  distribution. 


ci 


DIFFERENCE  IN  LOCATION 


TABLE  D-3 


CASE  2 


GAMMA  (n  =  2) 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

SeunDle  Size 


5 

10 

94 

90 

92 

72 

90 

49 

85 

24 

85 

6.5 

78 

0 

72 

0 

Difference  in  location  is  the  same  for  each  "direction 
i.e.  .5  is  in  each  direction. 

Estimated  probability  of  Type  I  error  =  1- 
when  difference  in  location  is  0. 

j  4 1  i  ^  ^  Percentace 


Estimated  probability  of  Type  II  error  = 
when  difference  in  location  is  not  0. 


FERENCE  IN  LOCATION  OF  EMITTERS 


TABLE  D-4 


1  .7 
.7  1 
0  0 
0  0 


CASE  2 


GAMMA  (n  =  4) 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 


Saznisle  Size 


5 

10 

20 

50 

100 

mm 

95 

92 

92 

92 

95 

D 

92 

64 

34 

4 

0 

90 

18 

0 

0 

0 

B 

86 

1 

0 

0 

0 

2.0 

78 

0 

0 

0 

0 

2.5 

- 

- 

- 

- 

- 

3.0 

- 

- 

- 

- 

- 

b 

H 
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Difference  in  location  is  the  same  for  each  "direction 
i.e.  .5  is  in  each  direction. 

Estimated  probability  of  Type  I  error  =  1- 
when  difference  in  location  is  0. 


Estimated  probability  of  Type  II  error  = 
when  difference  in  location  is  not  0. 


100 


TABLE  D-5 


0 

0 

1 

.7 


CASE  2 


GAMMA  (  n  =  9) 


cn 

K 

U 

E- 

M 

z 

u 

b 

O 

z 

o 

M 

< 

u 

s 


E:] 

o 

z 

u 

z 

SQ 

b4 

b 

H 
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PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

Samole  Size 


5 

10 

20 

50 

100 

■Ml 

97 

93 

97 

92 

- 

n 

90 

60 

39 

5 

0 

89 

16 

0 

0 

0 

85 

4 

0 

0 

0 

2.0 

80 

0 

0 

0 

0 

2.5 

- 

- 

- 

- 

- 

3.0 

- 

— 

— 

— 

— 

Difference  in  location  is  the  same  for  each  "direction 
i.e.  <5  is  in  each  direction. 

Estimated  probability  of  Type  I  error  =  1  - 
when  difference  in  location  is  0. 

Estimated  probability  of  Type  II  error  * 
when  difference  in  location  is  not  0. 


91 


TABLE  D-6 


E"  1  .35  0  0 

_  .35  .50  0  0 

-  0  0  .5  .35 

0  0  .35  .5 


CASS  2 


GAMMA  (n  =  9) 

PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

Sair.r>le  Size 


10 

20 

50 

100 

94 

93 

96 

96 

42 

6 

0 

0 

3.5 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

- 

- 

- 

- 

Difference  in  location  is  the  same  for  each  "direction 
i.e.  .5  is  in  each  direction. 


Estimated  probability  of  Type  I  error  *  1  - 
when  difference  in  location  is  0. 


Percentage 

loo 


Estimated  probability  of  Type  II  error 
when  difference  in  location  is  not  0. 


Percentage 

100 


LOCATION  OF  EMITTERS 


TABLE  D-7 


i-[: 


CASE  2 -Bivariate 


GAMMA  (n=3) 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

Sample  Size 


Difference  in  location  is  the  same  for  each  "directxon"r 
i.e.  .5  is  in  each  direction. 

Note  that  two  sets  of  simulations  were  run  for  Seunple  Size  5 


Estimated  probability  of  Type  I  Error  =  1- 

when  difference  in  location  is  0. 

Estimated  probability  of  Type  II  Error  »  — 
when  difference  in  location  is  not  0. 


Percentage 


Percentage 

TUB 


DIFFERENCE  IN  LOCATION  OF  EMITTERS 


TABLE  D-8 


Z- 


5  2 
2  1 


CASE  2 -Bivariate 


GAMMA  (n=2) 

PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

SaiRDle  Size 


20 

50 

92 

93 

2 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

Difference  In  location  is  the  same  for  each  "direction", 
i.e.  .5  is  in  each  direction. 

Note  that  two  sets  of  simulations  were  rxui  for  Sarnie  Size  5 
Estimated  probability  of  Type  I  Error  ■  1- 
when  difference  in  location  is  0. 


Estimated  probability  of  Type  II  Error 
when  difference  in  location  is  not  0. 


Percentage 
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Gamma  Distribution,  Case  3 


The  test  assumes  the  underlying  distribution  is  Normal,  that  the 
variance-covariance  matrix  is  known,  and  the  location  is  estimated  by  the 
mean  of  the  data. 

The  first  two  assumptions  are  not  met.  The  data  follow  a  Gamma 
distribution,  not  the  Normal  distribution,  and  the  variance-covariance 
matrix  used  in  the  test  is  estimated  from  the  data,  not  known.  However, 
the  third  assumption  is  met. 


Summary 


Probability  of  Type  I  error: 


Varies  between  0.80  and  0.06  depending  on  sample  size, 
rates  occur  for  4-variate  data. 


Higher  error 


Probability  of  Type  II  error: 

Varies  between  0.42  and  0  depending  mainly  on  sample  size,  assumed 
difference  in  location  of  emitters  and  magnitude  of  variance-covariance. 
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TABLE  D-9 


10  0“ 
4  0  0 

0  4  1 

0  14 


GAMMA  (n  =  2) 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 
m 

SaiTiTsle  Size 


M 

5 

10 

20 

50 

100 

s: 

0.0 

20 

64 

82 

88 

94 

O 

2 

0.5 

13 

39 

40 

22 

6 

O 

M 

1.0 

6 

17 

9 

0 

0 

< 

CJ 

Q 

1.5 

4 

3 

1 

0 

0 

z 

M 

2.0 

1 

0 

0 

0 

0 

Ed 

o 

3.0 

0 

0 

0 

0 

0 

z 

r-T 

E 

4.0 

0 

0 

0 

0 
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TABLE  D-11 


GAMMA  (n  =  9) 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

SsinDle  Size 


5 

10 

24 

64 

14 

28 

5 

2.5 

.75 

0 

.25 

0 

- 

- 

FERFNCK  IN  LOCATION  OF  EMITTERS 


TABLE  D-12 


I-f1 


i3 


CASES 


GAMMA  (n  =  4) 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

Samole  Size 


5 

10 

26 

62 

4 

16 

.5 

0 

0 

0 

0 

0 

Difference  in  location  is  the  same  for  each  "direction”, 
i.e.  .5  is  in  each  direction. 


Estimated  probability  of  Type  I  error  =  1  - 


when  difference  in  location  is  0 


Estimated  probability  of  Type  II  error  = 
when  difference  in  location  is  not  0. 


TABLE  D-13 


E- 


CASE  3-Bivariate 


GAMMA  (n=3) 

PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

Samnle  Size 


5 

10 

20 

50 

100 

65 

87 

90 

92 

90 

41 

29 

20 

0 

0 

14 

4 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

Difference  in  location  is  the  same  for  each  "direction", 
i.e.  .5  is  in  each  direction. 

Note  that  two  sets  of  simulations  were  run  for  Sample  Size  5 
Estimated  probability  of  Type  I  Error  =  1- 
when  difference  in  location  is  0. 

Estimated  probability  of  Type  II  Error  =  _£ercentaae 
when  difference  in  location  is  not  0 . 


DIFFERENCE  IN  LOCATION  OF  EMITTERS 


CASS 


:ABLE  D~14 

E- 


3-Bi variate 


GAMMA  (n=2) 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 


SaitiDle  Size 


5 

10 

20 

50 

100 

mm 

72 

86 

89 

90 

94 

D 

42 

38 

20 

1 

0 

14 

4 

0 

0 

0 

B 

3 

0 

0 

0 

0 

2.0 

2 

0 

0 

0 

0 

2.5 

0 

0 

0 

0 

0 

3.0 

0 

0 

0 

0 

0 

Difference  in  location  is  the  same  for  each  "direction", 
i.e.  >5  is  in  each  direction. 

Note  that  two  sets  of  simulations  were  run  for  Sample  Size  5. 
Estimated  probability  of  Type  I  Error  =  1- 


when  difference  in  location  is  0. 

Estimated  probability  of  Type  II  Error 
when  difference  in  location  is  not  0 . 


Percentaae 

TUB 
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Gamma  Distribution,  Case  4 


The  test  assumes  the  underlying  distribution  is  Normal,  that  the 
variance-covariance  matrix  is  known,  and  that  the  location  is  estimated  by 
the  mean  of  the  data. 

The  first  two  assumptions  are  not  met.  The  data  follow  the  Gamma 
distribution,  not  the  Normal  distribution,  and  the  variance-covariance 
matrix  used  in  the  test  is  varied  from  the  true  by  a  percentage  (-4jt  to 
4X).  This  could  be  thought  of  as  an  effect  of  calibration  error.  However, 
the  third  assumption  is  met. 


Summary 


Probability  of  Type  I  errors 


Varies  between  0.24  and  0.02  depending  on  the  magnitude  of  the 
variance-covariance  error. 


Probability  of  Type  II  error: 

Varies  between  0.96  and  0  depending  on  sample  size,  assumed  difference 
in  location  of  emitters  and  magnitude  of  the  variance-covariance  error. 


Change  in  Covariance  Matrix 


TABLE  D-15 


CASE  4 


E- 


GAMMA  (n  =  2) 


1  .7  0  O" 

7  10  0 

0  0  1  .7 

0  0  .7  1 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 


Sample  Size 


%  Cheuige 


Oiff.  in  Loc, 
Of.  Emitter 


76 

79 

2 

0 

Estimated  probability  of  Type  I  error  =  1- 
Estimated  probability  of  Type  II  error  = 
For  0%  Change  see  Table  D-1  results. 


Change  in  Covariance  Matrix 


•J 


TABLE  D-16 


CASE  4 -Bivariate 


Z- 


1  0 
0  1 


GAMMA  (n=3) 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 


Sample  Size 


Diff.  in  Log. 
%  Change  Of.  Emitter 


4% 


0 

1 

2 


O  O  O  oolo  O  O  00 
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APPENDIX  E 


Gamma  Mode  Distribution 


In  the  simulation  results  which  follow  the  test  distribution  is  the 
Gamma  distribution. 

The  Gamma  distribution  is  a  skewed  distribution  whose  mean  (expected) 
value  is  to  the  right  of  the  mode  (most  frequent)  value.  For  the 
simulation  cases  which  follow  the  location  was  estimated  by  the  mode  (most 
frequent)  value  rather  than  the  mean  (expected)  value  of  the  data. 


The  density  sketched  above  follows  the  general  equation 


f(x)  - 


^  j*'*'  •'^\'***  ••  **'>  ”  J*  ">  V 
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The  data  are  vectors  with  four  con?>onents  (x. ,x_,x.,x, ). 

These  components  are  generated  as  follows: 

*1-  *1/1 

X2  -  a2yi  ♦  a3y2 

*3  -  a^ys 

"4  -  ^^3  "  ^6^4 

where  y^^  ,  y^ ,  yj  and  y^  are  independent  gamma  variables  and  the  *s  are 
constants.  TFor  each  simulation  case  the  y's  were  generated  with  the  same 
n  value  but  with  varying  lambda  values).  For  the  bivariate  case,  only  the 
first  two  components  were  used. 

Sample  sizes  vary  from  5  to  100.  The  tables  which  follow  are 
categorized  by  the  four  basic  assumptions  about  the  variance-covariance 
(see  Page  14  ). 


Gamma  Mode  Distribution,  Case  1 


The  test  assumes  the  underlying  distribution  is  Normal,  that  the 
variance-covariance  matrix  for  the  distribution  is  known,  and  that  the 
location  is  estimated  by  the  mean  of  the  data. 

The  first  and  third  assumptions  are  not  met.  The  data  follow  a  Gamma 
distribution,  not  the  Normal  distribution.  Also,  the  location  is  estimated 
by  the  mode  of  the  data.  However,  the  second  assumption  is  met. 


Summary 


Probability  of  Type  I  error: 

Varies  between  1.0  and  0.88  depending  on  sample  size.  The  larger  the 
sample  size  the  larger  the  probability  of  error. 


Probability  of  Type  II  error; 


Almost  0 


TABLE  E-1 


V  J.  0 

\  =  0  1 
/  0  0 

L — I  0  0 


CASE  1 


GAMMA  (n  =  2) 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

Sample  Size 


Mean:  (1.414,  1.414,  1.414,  1.414) 
Mode:  (0.707,  0.707,  0.707,  0.707) 


Difference  in  location  is  the  same  for  each  "direction 
i.e.  .5  is  in  each  direction. 


Estimated  probability  of  Type  I  error  =  1- 
when  difference  in  location  is  0 . 


Percentage 

ruu 


Estimated  probability  of  Type  II  error 


Percentaae 


rMTi 


TABLE  E-2 


Z- 


CASE  1-Bivariate 


GAMMA  (n=3) 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

SamTsle  Size 


Gamma  Mode  Distribution,  Case  2 


The  test  assumes  the  underlying  distribution  is  Normal,  that  the 
variance-covariance  matrix  is  estimated  from  the  data,  and  that  the 
location  is  estimated  by  the  mean  of  the  data. 

The  first  and  third  assunptions  are  not  met.  The  data  follow  a  Gamma 
distribution,  not  the  Normal  distribution.  Also,  the  location  is  estimated 
by  the  mode  of  the  data.  However,  the  second  assumption  is  met. 


Summary 


Probability  of  Type  I  error; 

Varies  between  1.0  and  0.09  depending  on  sample  size.  The  larger  the 
sample  size  the  larger  the  probability  of  error. 


Probability  of  Type  II  error: 

Varies  between  0.88  for  small  sample  size  and  0  for  larger  sample 
size.  Also  depends  on  assumed  difference  in  location  of  emitters.  Error 
is  less  than  when  the  distribution  is  Normal. 


TABLE  E-3 


Ei  a 
=  0  1 
0  0 
0  0 


GAMMA  (n  =  2) 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

Sample  Size 


Mean;  (1.414,  1.414,  1.414,  1.414) 
Mode;  (0.707,  0.707,  0.707,  0.707) 


Difference  in  location  is  the  same  for  each  "direction 
i.e.  .5  is  in  each  direction. 


Estimated  probability  of  Type  I  error  =  1- 
when  difference  in  location  is  0. 


Percentaqe 


Estimated  probability  of  Type  II  error  = 


Percentage 


TABLE  E-4 


El  0  0 
_  0  2  0 
“003 
0  0  0 


GAMMA  (n  =  3) 

PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

Sample  Size 


5 

10 

20 

92 

42 

5 

88 

7 

0 

87 

0.5 

0 

78 

0 

0 

74 

0 

0 

73 

0 

0 

65 

0 

0 

Mean:  (1.732,  2.449,  3.000,  3.464) 
Mode:  (1.155,  1.633,  2.000,  2.309) 


Difference  in  location  is  the  same  for  each  "direction 
i.e.  .5  is  in  each  direction. 

Estimated  probability  of  Type  I  error  =  1- 


when  difference  in  location  is  0. 
Estimated  probability  of  Type  II  error  = 


Percentage 


DIFFERENCE  IN  LOCATION  OF  EMITTERS 


TABLE  E-5 


CASE 


0 

1 


-Bivariate 


GAMMA  (n=3) 

PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

Samole  Size 


Mean:  (1.732,  1.732) 
Mode;  (1.155,  1.155) 


Difference  in  location  is  the  same  for  each  "direction 
i.e.  .5  is  in  each  direction. 


Estimated  probability  of  Type  I  error  = 
when  difference  in  location  is  0 . 

Estimated  probability  of  Type  II  error  = 


,  Pei centaae 


Percentaae 

loo 


Gamma  Mode  Distribution,  Case  3 


The  test  assumes  the  underlying  distribution  is  Normal,  that  the 
variance-covariance  matrix  is  known,  and  the  location  is  estimated  by  the 
mean  of  the  data. 

None  of  the  assumptions  are  met.  The  data  follow  a  Gamma 
distribution,  not  the  Normal  distribution.  The  variance-covariance  matrix 
used  in  the  test  is  estimated  from  the  data,  not  known.  The  location  is 
estimated  by  the  mode  of  the  data. 


Summary 


Probability  of  Type  I  error; 

Varies  between  1.0  and  0.93  depending  on  sample  size, 
sample  size  the  larger  the  probability  of  error. 


The  larger  the 


Probability  of  Type  II  error: 


Almost  0. 


m 

% 

S' 


GAMMA  (n  =  2) 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 


Sample  Size 


5 

10 

20 

50 

100 

0.0 

4 

3 

0.4 

0 

0 

0.5 

0.1 

0 

0 

0 

0 

1.0 

0 

0 

0 

0 

0 

1.5 

0 

0 

0 

0 

0 

2.0 

0 

0 

0 

0 

0 

2.5 

0 

0 

0 

0 

0 

3.0 

0 

0 

0 

0 

0 

Mean:  (1.414,  1.414,  1.414,  1.414) 
Mode;  (0.707,  0.707,  0.707,  0.707) 


Difference  in  location  is  the  same  for  each  "direction 
i.e.  .5  is  in  each  direction. 

Estimated  probability  of  Type  I  error  =  1- 
when  difference  in  location  is  0. 

Estimated  probability  of  Type  II  error  =  — 


FERENCE  IN  LOCATION  OF  EMITTERS 


TABLE  E-  7 


El  0  0  0 

_  0  2  0  0 

”  0  0  3  0 

0  0  0  4 


GAMMA  (n  =  3) 

PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 

Samcle  Size  _ 


Mean;  (1.732,  2.449,  3.000,  3.464) 
Mode;  (1.155,  1.633,  2.000,  2.309) 


Difference  in  location  is  the  same  for  each  "direction", 
i.e.  .5  is  in  each  direction. 


Estimated  probability  of  Type  I  error  =  1- 
when  differ-ence  in  location  is  0. 


Percentaoe 

no’s  ^ 


Estimated  probability  of  Type  II  error 


Percentage 

— rso 


DIFFERENCE  IN  LOCATION  OF  EMITTERS 


CASE  3-Bi variate 


GAMMA  (n=2) 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 


Samole  Size 


5 

10 

20 

50 

100 

o 

• 

o 

44 

32 

14 

0 

0 

0.5 

9 

2 

0 

0 

0 

1.0 

1 

0 

0 

0 

0 

1.5 

0 

0 

0 

0 

0 

2.0 

0 

0 

0 

0 

0 

2.5 

0 

0 

0 

0 

0 

3.0 

0 

0 

0 

0 

0 

Mean;  (1.732,  1.732) 
Mode;  (1.155,  1.155) 


Difference  in  location  is  the  same  for  each  "direction 
i.e.  .5  is  in  each  direction. 

Estimated  probability  of  Type  I  error  «  1- 
when  difference  in  location  is  0. 

Estimated  probability  of  Type  II  error  «  Percenta2e 


Gamma  Mode  Distribution,  Case  *4 


e 


i 


§ 


I 


The  test  assumes  the  underlying  distribution  is  Normal,  that  the 
variance-covariance  matrix  is  known,  and  that  the  location  is  estimated  by 
the  mean  of  the  data. 

None  of  the  assumptions  are  met.  The  data  follow  the  Gamma 
distribution,  not  the  Normal  distribution,  and  the  variance-covariance 
matrix  used  in  the  test  is  varied  from  the  true  by  a  percentage  (-4X  to 
4J).  This  could  be  thought  of  as  an  effect  of  calibration  error.  The 
location  is  estimated  by  the  mode  of  the  data. 


Summary 


Probability  of  Type  I  error: 

Varies  between  1.0  and  0.76  depending  on  sample  size  and  the  magnitude 
of  the  variance-covariance  error.  The  larger  the  sample  size  the  larger 
the  probability  of  type  I  error. 


Probability  of  Type  II  error: 
From  0.24  to  0. 


P 

i 

i 


Change  in  Covariance  Matrix 


TABLE  E-  9 


CASE  4 


E- 


GAMMA  (n  =  2) 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 


Samole  Size 


Oiff.  in  Loc. 
%  Change  Of. Emitter 


Estimated  probability  of  Type  I  error  » 
Estimated  probability  of  Type  II  error 


Percentaae 

100 

Percentaae 

100 


For  0%  change  see  Table  E-1. 

Mean:  (1.414,  1.414,  1.414,  1.414) 
Mode:  (0.707,  0.707,  0.707,  0.707) 


,x  I 


Change  in  Covariance  Matrix 


TABLE  E-10 
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CASE  4 


El  0  0  0 
0  2  0  0 

=  0  0  3  0 

0  0  0  4 


GAMMA  (n  -  3) 


PERCENTAGE  OF  LOCATIONS  ACCEPTED  AS  THE  SAME 


Sample  Size 


Di££.  in  Loc. 
%  Change  0£.  Emitter 


Estimated  probability  o£  Type  I  error  *  1- 

Estimated  prpbability  o£  Type  II  error  «  - 

Mean;  (1.732,  2.449,  3.000,  3.464) 

Mode;  (1.155,  1.633,  2.000,  2.309) 


Percentaae 
100 

Percentaae 

100 


*.»  Si 
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TABLE  E-11 


CASE  4 -Bivariate 


I 


E- 


APPENDIX  F 


CASE  1 


Mathematical  Tests 


“  Eq  Z  known 


^1=  E  Ho 


The  data  are  •••/  ^  i  =  1,  ...,  n. 

For  this  report  p  =  2  or  4.  Ihe  data  are  assuned  to  be  independ¬ 
ent  observations  for  a  p-variate  (4-variate)  normal  distribution 
with  mean  y  =  (y, ,  y- ,  .../  y  )  and  the  variance- covariance  matrix 

-  1  2  p 


2 


Let  a  be  the  level  of  significeuice  of  the  test  that  is,  the 
probability  we  reject  when  it  is  true. 


CASE  1 


We  estimate  y  by 


x=  i 

—  n 


n 

1=1 


n 


n 

Lx. 

1 


2  —  -1  —  T  2 

The  test  is  to  reject  Hq  if  T  =  n  (x  -  y^)L  ”  Ho^  ^ 

2 

where  Xp  to)  is  the  100  (1-a)  percentile  point  of  a  chi-square 
disbribution  with  p  degrees  of  freedom*. 


2 

When  y.  y^#  T  is  distributed  as  a  non-central  chi-square  with 
p  degrees  of  freedom  and  non-centrality  parameter 

n  (y  -  Wq)  (y  -  ilQ)\ 


2 

*In  this  case  the  test  statistic  T  is  called  Hotellings 
2 

T  statistic. 


CASE  2 


Hq:  y  *  Uq  fixed,  Z  iinknown 

]Lo 


The  data  follow  the  same  assiunptlons  as  for  Case  1  except 
that  Z  is  xinknown.  We  estimate  Z  by  the  matrix 


S 


1 

n-1 


n 

Z 

i=l 


X  ) 


T 


2  —  —1  —  T  2 

The  test  is  to  reject  if  T  =  n  (x  -  S  (x  ”  Tq 

where  T  ^  _  p  ,  . 

wnere  1q  j^_p  p  p,n-p  (a). 

F  (a)  is  the  100  (1-a)  percentile  point  of  an  F  distribution 

P*n"P 

with  p,n-p  degrees  of  freedom. 

2 

When  w  =  Uq/  T  is  distributed  as  non-central  F  distribution. 


For  details  see  Anderson,  T.W.,  An  Introduction  to  Multivariate 
Statistical  Analysis.  New  York:  John  Wiley  &  Sons,  Inc.  1958, 


CASE  3 


^0*  H.  “  Eq  Assume  L  =  S 

E  Ho 

The  data  follow  the  seune  assumptions  as  for  Case  1  except 
that  we  assume  our  estimate  of  1,  S,  is  in  fact  equal  to  Z. 

The  test  decision  is  to  reject  Hq  if 

T^  =  n  (x  -  S”^  (x  -  ^  Xp  (a) 

where  Xp  (a)  is  the  100  (1-a)  percentile  point  of  a  chi-square 

distribution  with  p  degrees  of  freedom,  a  is  the  level  of 
significance  (probability  of  rejection  Hq  when  it  is  true) . 

In  this  case  the  assumption  that  Z  =  S  will  not  be  valid 
for  small  Seunple  sizes  2m,d  the  probability  of  rejecting  H.  when 
it  is  true  will  be  much  greater  than  oi. 


0 

E 

I 


CASE  4 
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^0 '  ii.  ”  Ho  ^  assumed  known 


^1=  H  Ho 


The  data  follow  the  same  underlying  assumptions  as  for 
Case  1  except  that  our  assvuned  known  Z  is  ■inoorveat.  The  correct 
variance-covariance  matrix  is  aZ. 


The  test  is  to  reject  Hq  if 


T^  =  n  (X  -  Z"^  (X  -  X^P  (a) 


where  x  p  (d)  is  the  100  (l>-a)  percentile  point  of  a  chi-square 

distribution  with  p  degrees  of  freedom,  a  is  the  level  of  signifi¬ 
cance  (probability  of  rejecting  Hq  when  it  is  true) . 


In  this  case  the  is  incorrect.  If  we  knew  "a”  it  would 
be  correct  to  use  the  T^  below  as  the  test  statistic.  Reject  if 


T^  =  n  (x  -  ^  )  (aZ)“  (x  -  ax  p  (a) 


vv  . 


m 


APPENDIX  G 


Simulation  Procedures 


The  test  error  frequency  values  cited  in  the  previous  appendices  were 
generated  through  a  simulation  technique.  More  explicitly,  a  VAX-n/780 
mini-computer  located  at  Claremont  McKenna  College  was  used  to  generate 
observations  from  various  distributions  (Normal,  Gillis,  etc.),  and  the 
tests  were  repeated  200  to  500  times.  The  figures  listed  in  the  tables  are 
the  percentage  of  the  time  the  tests  accepted.  For  example,  if  for  a  given 
case  the  test  accepted  190  out  of  200  times,  the  reported  figure  would  be 
95?. 


The  simulations  were  conducted  in  the  following  manner.  First,  a  case 
was  chosen : 

1 )  The  location  and  parametrics  of  the  emitter  in  the  database  were 
specified. 

2)  The  distribution  of  the  incoming  data  was  specified  (Normal, 
Gillis,  etc.). 

3)  The  test  to  be  used  was  specified,  corresponding  to  the  four 
cases  listed  on  page  14. 

^)  The  parameters  of  the  data  distribution  were  specified;  for 

example,  the  mean  and  variance-covariance  for  the  Normal.  In 
order  to  determine  the  probability  of  type  II  error,  the  data 
were  frequently  generated  around  a  mean  point  other  than  the  true 
location  of  the  emitter. 

5)  The  sample  size  (number  of  data  observations)  was  specified. 


Then,  for  this  particular  case,  the  following  steps  were  performed  200 
to  500  times; 


1)  Generate  n  observations  from  the  given  distribution  (where  n  is 
the  sample  size). 

2)  Conduct  th9  chosen  test. 

3)  Record  acceptance  or  rejection. 


127 


APPENDIX  H 


RELATING  ELLIPSE  SHAPE  WITH  MEASUREMENT  ERROR 


This  appendix  explains  how  the  shape  of  a  confidence  ellipse  is  related 
to  the  associated  covariance  matrix.  Note  that  the  geometrical  analysis  of 
this  appendix  applies  primarily  to  the  bivariate  simulations. 

The  covariance  matrices  I  used  in  this  report  measure  the  error  in  the 
individual  observations.  The  coveU'iance  matrix  vrtiich  reflects  the  combination 
of  this  data  is  I/n,  vrtiere  n  is  sample  size.  The  equation  of  the  95% 
confidence  ellipse  is 

(3^- ji)T(z/n)-1(2^- _u)  -  xi(.95)  •  5.991 

Increased  sample  size,  n,  decreases  the  amount  of  measurement  error.  The 
ultimate  effect  of  this  is  to  decrease  the  size  of  the  confidence  ellipse. 

When  sample  size  is  not  mentioned  in  this  report  assume  that  n-1 . 

Let  A  be  the  variance  in  the  east-west  (x)  direction  and  C  be  the 
variance  in  the  north-south  (y)  direction.  The  covariance  is  B.  Then  the 
covariance  matrix  is  as  follows: 


Given  this  matrix  the  lengths  of  the  semi-major  and  semi-minor  axes  are 
given  by:  _ 

Length  of  Semi-Major  Axis  -  >/  5.991  •  (  A+CV(a-C) )  /(2n) 

Length  of  Semi-Minor  Axis  -  >/  5.991  *  (  A+C-/(A-C) *+^B*  )  /(2n) 

The  angle  ip  (measured  clockwise  from  North)  of  the  Semi-Major  Axis  is 
e  -  90*-[Arcsin(2B//(A-C) *-*-HB* ) ]/2  if  ASC 
or  6  -  [ArcslnC 2B//( A-C ) **MB* ) ]/2  if  A<C 

Applying  this  to  the  matrices  referenced  in  this  report: 

For  r  ■  (0*3  0*3^  (used  without  reference  to  n,  i.e.  n-1 ) 

We  have:  Angle  to  the  Semi-Major  Axis,  0-85.^* 

Semi-Major  Axis  -  4.91 
Semi-Minor  Axis  -  1 .29 


For  Z  -  0.3l 

^  ^0.3  2.0^ 

We  have:  Angle  to  the  Semi-Major  Axis,  0-9.22® 
Semi-Major  Axis  -  3*50 
Semi-Minor  Axis  -.95 


(used  without  reference  to  n,  i.e.  n-1 ) 


! 


For  I  -  0*15^  (used  without  reference  to  n,  l.e.  n-1 

We  have:  Angle  to  the  Semi-Major  Axis,  0-85.4* 

Semi-Minor  Axis  -  3.47 
Semi-Major  Axis  -  .91 

For  I  -  (Jj  0) 

We  have:  Angle  to  the  Semi-Major  Axis,  NONE  -  it  is  circular 
Lengths  of  axes  depend  on  sample  size,  n,  as  follows 


iple  Size 

Semi-Major  Axis 

Semi-Minor  Axis 

5 

1 .09 

1 .09 

10 

.77 

:77 

20 

.55 

.55 

50 

.37 

^37 

100 

.24 

.24 

For  r  -  (| 

We  have:  Angle  to  the  Semi-Major  Axis,  0-67.5 

Lengths  of  axes  depend  on  san^le  size,  n,  as  follows 


Sample  Size 

Semi-Major  Axis 

Semi-Minor  Axis 

5 

2.64 

.45 

10 

1.87 

.32 

20 

1.32 

.23 

50 

.83 

.14 

100 

.59 

.10 

^  ^  “  ^0i7  1;0^ 

We  have;  Angle  to  the  Semi -Major  Axis,  0-45* 

Lengths  of  the  axes  depend  on  sample  size,  n,  as  follows 

Sample  Size  Semi-Major  Axis  Semi-Minor  Axis 


